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ABSTRACT 


In  this  dissertation,  we  investigate  the  use  of  a 
dynamic  priority  scheduling  rule  proposed  by  Kleinrock  in  a 
M/G/l  queueing  model  and  study  its  applicability  as  a 
scheduling  rule  for  transaction  processing  systems. 

In  this  rule,  each  job  has  a priority  index  and  it 
increases  linearly,  starting  from  zero  when  the  job  arrives, 
at  a rate  which  is  assigned  to  the  job's  priority  class. 
When  the  server  becomes  free,  it  selects  the  job  for  service 
whose  priority  index  is  the  largest.  The  rates  for  the 
priority  classes  are  a set  of  parameters  that  can  be  varied 
to  control  the  waiting  times  of  jobs  in  each  priority  class. 

To  study  this  rule,  we  first  derived  the  feasible 
performance  space  of  mean  waiting  times  for  each  class  of 
jobs.  Then  we  derived  an  algorithm  for  determining  the 
values  of  the  control  parameters  for  any  given  set  of 
feasible  mean  waiting  times.  We  then  discuss  optimization 
of  several  cost  models  using  functions  of  mean  waiting 
times.  For  each  cost  function,  a procedure  to  obtain  an 
optimal  combination  of  feasible  mean  waiting  times  is 
presentee.  This  set  of  mean  waiting  times  can  then  be  used 
to  obtain  optimal  values  for  the  control  parameters. 


At  saturated  congestion,  we  proved  a multiplicative 
version  ot  Jackson's  conjecture  which  states  that  the  ratios 
among  different  classes  of  mean  waiting  times  are 
proportional  to  the  reciprocals  of  the  control  parameters. 

Simulation  studies  of  the  variance  behavior  show  that 
as  discrimination  among  different  classes  increases,  the 
variance  of  waiting  times  for  each  class  of  jobs  varies  and 
it  either  strictly  increases  or  strictly  decreases  following 
the  same  direction  as  the  mean  waiting  time  cnanges. 

A sensitivity  study  of  mean  waiting  times  shows  that 
the  relative  waiting  times  among  different  classes  of  jobs 
remains  fairly  steady  under  reasonable  fluctuations  of 
system  congestion. 

The  results  obtained  from  our  study  show  that  this 
scneduling  rule  is  applicable  for  use  in  transaction 
processing  systems.  The  capability  of  adjusting  the  control 
parameters  to  provide  mean  waiting  times  performance  between 
tnat  of  f ir st-come-f ir st-serve  and  fixed  priority  scheduling 
allows  the  designer  to  select  from  a wide  range  of  response 
times  for  different  classes  of  transactions. 
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CHAPTER  1 
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TRANSACTIONS  PROCESSING  SYSTEMS 

1 . (J  Introduction 

Over  the  past  tew  years,  a very  pronounced  trend  of 
increased  usage  of  on-line  computer  systems  has  developed  in 
data  processing.  One  type  of  on-line  system  that  is 
proliferating  in  use  is  Transaction  Processing  System. 
Transaction  processing  systems  generally  handle  an 
organization's  operational  data  as  the  data  are  generated 
and  needed  by  on-going  operations.  This  method  of 
processing  aata  reduces  clerical  data  handling,  gives  an 
almost  instant  turnaround  time  and  provides  current  and 
accurate  information  whenever  and  wnerever  needed. 

Simply  stated,  transaction  processing  systems  are  a 
special  type  of  computer  information  systems  designed  for 
non-expert  users  to  communicate  with  the  computers  for 
on-line  processing  of  transactions.  The  computer  handles 
transaction  workloads  by  running  a set  of  previously  stored 
application  programs  to  interact  with  a centrally  managed 
data  base.  Such  systems  are  found  in  a vast  variety  of 
organizations,  including  airline,  banking,  medical, 
insurance,  manufacturing  and  government. 

One  representative  application  of  transaction 
processing  systems  is  the  airline  reservation  system 
IKNIG72] . Initially  designed  solely  for  controlling  seat 


inventory  and  maintaining  limited  passenger  records 
[PERR61 ] , airline  systems  have  now  evolved  into  large, 
complex  system-  that  maintain  waiting  lists,  provide  flight 
information,  handle  provisions  for  special  facilities  such 
as  hotel  reservation  and  car  rental,  perform  load  and  trim 
calculations  prior  to  take-off,  and  even  interact  with  each 
other  for  inter-airline  flight  information. 

Since  transaction  processing  applications  usually 
involve  a large  volume  of  transactional  data  for  processing, 
scheduling  rules  that  provide  fast  response  times  and 
efficient  usage  of  computer  systems  resources  are  desi.rable. 

This  dissertation  discusses  a time  dependent  priority 
scheduling  rule  for  transaction  processing  systems  (TPS) 
which  we  call  escalating  priority  scheduling.  In  this 
scheduling  rule,  a set  of  control  parameters  can  be  adjusted 
by  the  designer  to  provide  discriminations  between  different 
priority  classes  of  transactions.  This  rule  was  first 
proposed  by  Kleinrock  [KLEI64]. 

In  investigating  this  scheduling  rule,  we  study  the 
behavior  and  limitations  of  the  rule,  derive  the  feasible 
performance  space  of  mean  waiting  times  allowed,  and  discuss 
some  cost  models  for  optimizing  this  scheduling  rule. 

In  optimizing,  our  solution  procedure  is  divided  into 
the  following  two  stages:  First,  for  a given  objective 
function,  we  determine  the  optimal  mean  waiting  times  for 


each  priority  class.  Second,  given  a desirable  performance 
of  mean  waiting  times  that  is  feasible,  we  determine  the 
values  of  tne  control  parameters  that  will  realize  this 
aesirable  performance. 

In  the  remainder  of  this  chapter.  Section  1.1  gives  an 
introduction  to  transaction  processing  systems,  Section  1.2 
discusses  the  modeling  of  transaction  processing  systems  as 
queueing  systems,  Section  1.3  reviews  and  summarizes 
priority  scheduling  rules  that  are  relevant  to  our  research, 
and  Section  1.4  defines  the  problems  specific  to  escalating 
priority  scheduling,  and  gives  a more  detailed  overview  of 
tne  dissertation  work. 

1 . 1 Transaction  Processing  Systems 

There  are  certain  common  characteristics  in  all 
transaction  processing  (TP)  environments.  They  are  the 
following  [BOOT72] : 

1.  The  system  is  highly  user  oriented.  Data  entry  is 
through  terminals  which  are  located  convenient  to 
the  users.  The  computer  for  processing 
transactions  may  be  installed  in  a different 
location. 

2.  The  terminal  operating  procedure  for  data  entry  is 
built  around  "non-expert"  users,  and  is  usually 
quite  simple. 
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3.  The  input  data  items,  called  transactions,  are  of 
pre-aefined  type.  Transaction  volume  is  usually 
large,  and  transaction  applications  may  be 
diversified. 

4.  Application  programs  to  process  the  transactions 
are  prepared  and  stored  in  the  computer  in  advance 
and  are  invisible  to  the  user. 

5.  The  response  time  is  fast.  The  system  is  capable 
of  providing  the  user  with  the  desired  information 
within  seconds. 

6.  The  system  generally  maintains  an  integrated  data 
base.  Processing  of  each  transaction  usually 
involves  some  manipulation  --  adding,  updating, 
retrieving  or  deleting  — of  data  in  the  data  base. 


For 

software 


transaction  processing  applications,  sophisticated 
is  designed  to  meet  the  following  needs: 


1.  Simple  and  effective  interactive  procedures  for  an 
efficient  flow  of  information  between  terminal 
operators  and  the  computer. 

2.  Effective  utilization  of  system  resources  to  ensure 
satisfactory  performance. 
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3.  Easy  and  fast  implementation  of  application 
programs  for  system  service  enhancements. 

4.  Protective  method  of  data  handling  to  ensure  data 
base  integrity  and  security. 

5.  Efficient  and  effective  error  recovery  procedure  to 
maintain  system  availability. 

The  current  state-of-the-art  software  systems  designed 
to  meet  this  need  are  known  by  several  different  names: 
transaction  (processing)  monitor,  teleprocessing  monitor, 
and  transaction  processing  (operating)  executive.  Some  of 
the  better  known  transaction  processing  monitors  that  are 
available  for  general  use  are:  CICS  from  IBM  [IBM73] , TIP 
from  Univac,  TPE  from  Honeywell  [HONE73]  , TASK/MASTER  from 
Turnkey  Systems,  SHADOW  II  from  Cullinane  Corporation, 
INTERCOMM  from  Informatics  Inc.,  and  ENVIRON/1  from  Cincom 
Systems,  Inc. 

These  software  systems  are  designed  with  a high  degree 
of  generality  so  that  users  can  tailor  the  system  toward 
their  specific  applications.  Generally,  these  systems  can 
perform  the  following  tasks  [IBM73]: 

. Terminal  management  - Host  and  control  data  transfer 
between  a telecommunication  network  of  heterogeneous 
mix  of  terminals  and  application  programs. 
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. Multitask  control  - Schedule  and  support  concurrent 
processing  of  a number  of  application  programs  for  the 
wide  mixture  of  transaction  workloads. 

. File  management  - Provide  efficient  access  methods  in 
nandling  data  base  files.  Support  scheduling  and 
initiation  of  all  file  item  requests  made  by  the 
application  programs. 

. Storage  management  - Allocate  and  control  of  storage  for 
programs,  storage  for  input/output  buffering  areas  and 
temporary  storage  of  data. 

. Program  management  - Provide  a multiprogramming 
capability  while  offering  a real-time  program  fetch 
capability.  Intercept  program  interrupts  to  prevent 
total  system  termination. 

. Error  handling  - Detect  and  handle  error  conditions 
caused  by  hardware  or  software  malfunction.  Provide 
dump  facility  to  assist  in  analysis  of  programs  and 
transaction  undergoing  development  or  modification. 

In  the  literature,  there  is  little  detailed 
documentation  of  transaction  processing  systems.  Topics  of 
some  existing  articles  discussing  transaction  processing 
systems  are:  a design  methodology  for  TPS  [HIRC7b] , TP 
facilities  in  IBM  IMS/VS  systems  [MCGE77 ] , development  of  a 
high  performance  TPS  and  its  monitor  [SIWI77],  designing 
customized  systems  using  TP  monitors  IDAVE74 ] , running  TP 
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monitor  in  a special  operating  system  environment  [EADE77], 
and  report  of  a successful  use  of  TP  monitors  [GERA76] . In 
this  last  article  which  describes  a magazine/  book/  record 
system  of  Time,  Inc.  using  CICS  of  IBM,  it  is  reported  that 
the  system  is  capable  of  supporting  200  CRT  terminals  to 
process  750,000  transactions  per  week  which  interact  with  a 
aata  base  of  five  billion  characters,  while  providing  a mean 
response  time  of  one  second. 

For  the  evaluation  and  selection  of  commercially 
available  transaction  processing  monitors,  Matheny  et  al. 
[MATH 7 7 ] has  compiled  a list  of  evaluation  criteria.  They 
are:  (a)  terminal  support,  (b)  internal  facilities 
consideration,  (c)  application  programming  considerations, 
(d)  communication  functions,  (e)  operating  environment,  (f) 
implementation  / maintenance,  (g)  recovery  / reliability  / 
controls,  and  (h)  costs.  They  have  also  provided  a 
comparative  study  of  the  internal  characteristics  of  the 
following  five  transaction  processing  monitors:  CICS, 
INTERCOMM,  TASK/MASTER,  ENVIRON/1  and  SHADOW  II. 

1 . 2 Moaeling  of  Transaction  Processing  Systems 

To  date,  there  are  a limited  number  of  documented 
studies  on  the  design  and  performance  evaluation  of 
transaction  processing  systems.  [GERK74 ] , [SCHW77 ] and 
LSARZ77]  are  three  of  the  articles  in  the  literature  that 
specifically  address  these  problems.  Computer  performance 
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evaluations  are  commonly  used  for  the  design,  selection  and 
tuning  (peaking)  of  computer  systems.  Modeling  of  computer 
systems  as  queueing  systems  is  a common  and  desired  approach 
for  tne  evaluation  of  systems  performance.  This  approach  of 
studying  computer  timesharing  systems  has  proved  to  be 
successful  in  improving  the  design,  control  and 
effectiveness  of  systems  operation  [KLEI76]  . 


Transaction  processing  systems  are  very  similar  to 
general  timesharing  systems.  Both  kinds  of  systems  are 
designed  for  on-line  use  and  man-machine  interactions 
require  similar  constraints  on  the  systems  responsiveness. 
However,  certain  cnaracter istics  that  distinguish  these  two 
types  of  systems  have  limited  the  usefulness  of  models  that 
were  aeveloped  for  timesharing  systems  in  the  study  of 
transaction  processing  systems. 


Some  of  the  characteristics  unique  to  transaction 
processing  systems  are  the  following: 


(a)  For  each  transaction  generated  from  the  terminal, 
the  required  application  ptogram(s)  for  processing  this 
transaction  are  stored  in  the  system  in  advance.  Hence  the 
general  requirements  of  system  resources  needed  to  process 
this  transaction  are  known. 


types 
vary . 


(b)  Based  on  the  intrinsic  characteristics  of  different 
of  transactions,  their  response  time  requirements  may 
That  is,  some  types  of  transactions  are  more 


8 


important,  and  need  to  be  processed  with  priority.  On  the 
other  hand,  other  types  of  transactions  may  be  delayed 
slightly  without  significantly  affecting  the  terminal 
operations . 

(c)  Due  to  the  nature  of  transactions,  processing  logic 
of  the  application  programs  are  generally  quite  straight- 
forward, and  the  processing  path  of  programs  generally  do 
not  change  with  different  values  of  transaction  data. 

(d)  The  processing  times  required  for  all  types  of 
transactions  are  usually  no  more  than  one  second.  Thus,  in 
a transaction  processing  environment,  processing  overhead 
becomes  important.  The  technique  of  swapping  programs  in 
and  out  of  memory,  which  is  widely  used  in  timesharing 
systems,  is  generally  not  applicable  in  transaction 
processing . 

In  studying  queueing  systems,  tne  following  essential 
elements  need  to  be  characterized: 

1.  Source  population  - population  from  which  the 
entities  demanding  service  emanate. 

2.  Arrival  process  - the  pattern  by  which  the  entities 
arrive  to  the  service  facility. 

i.  Queue  structure  - Number  and  configuration  of 
waiting  lines. 
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4.  Service  facilities  - Number  of  service  channels  and 
ordering  of  services. 

5.  Service  process  - The  required  time  to  completely 
service  an  entity  at  each  service  station. 

6.  Scheduling  discipline  - The  rule  by  which  units  are 
selected  for  service. 

We  now  discuss  each  of  these  characteristics  in  the 
transaction  processing  environments: 

(1)  Source  population.  In  many  TPS,  there  are  a large 
number  of  users.  In  this  case,  infinite  populations  can  be 
used  to  describe  tne  source  population  of  arrival.  However, 
some  TPS  may  only  have  a small  number  of  terminals.  Then,  a 
finite  arrival  population  is  needed  for  modeling  the 
queueing  arrival. 

(2)  Arrival  process.  In  general,  arriving  transactions 
from  different  terminals  are  independently  generated. 
Studies  of  timesharing  systems  have  shown  that  when  a large 
number  of  terminals  are  active  (signed  on) , then  Poisson 
arrivals  can  be  assumed  [ANDE74J.  Simulations  of  stochastic 
processes  have  shown  that  when  there  are  10  or  more  active 
terminals,  the  superimposed  arrival  process  is  sufficiently 
close  to  a Poisson  process  even  though  individual  arrivals 
from  each  terminal  may  not  be  Poisson  [AGRA75] . However, 
when  the  arrival  population  is  small,  some  other  process  may 
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be  needed  to  describe  the  arrival  process. 


(i)  Queue  structure.  Depending  upon  the  type  of 
computer,  tne  number  of  priority  classes,  and  the  scheduling 
rule,  different  queue  structures  are  required.  Generally, 
for  priority  scheduling,  it  is  desirable  to  maintain 
separate  queues  for  each  priority  class  for  tne  use  of  each 
system  resource.  Since  each  transaction,  once  generated, 
will  need  to  be  processed  eventually,  an  infinite  waiting 
line  should  be  allowed  in  TPS. 

(4)  Service  facilities.  In  uniprogramming  environ- 
ments, CPU  and  I/O  operations  are  alternatively  active. 
Thus  the  two  services  together  can  be  thought  of  as  a single 
service.  In  this  case,  TPS  can  be  modeled  as  single  stage 
service  facilities.  However,  if  multiprogramming  is 
allowed,  a model  with  multiservers  facilities  is  necessary. 

(5)  Service  process.  For  the  processing  of 
transactions  of  the  same  type,  resource  requirements  in 
terms  of  core  storage,  processing  path  length,  number  and 
distribution  of  data  base  access,  file  structure  usage,  and 
memory  overlay  structure  are  very  similar.  The  straight- 
forward logic  in  application  programs  takes  a limited  amount 
of  time  to  execute.  Thus,  for  each  type  of  transaction,  the 
service  time  distribution  may  have  a small  mean  and  possibly 
a small  variance.  However,  due  to  the  nature  of  diversified 
applications,  different  transaction  types  may  assume  wide 
variations  of  service  time  distributions.  It  is  thus 
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desiraole  to  have  a general  distr ioution  for  describing  the 
service  process. 

(6)  Scheduling  discipline.  In  view  of  the  nature  of 
different  transaction  applications,  it  is  generally 
desirable  to  use  priority  scheduling  rules  in  TPS  to  satisfy 
different  degrees  of  response  performance  to  different  types 
of  transactions. 

Thus,  TPS  can  be  modeled  as  multiclass  priority 
queueing  systems  with  Poisson  arrivals  and  general  service 
time  distributions.  The  model  queue  structure,  scheduling 
discipline,  whether  an  infinite  or  finite  source  population, 
and  whether  single  or  multi  server (s)  depend  upon  the 
computer  system  and  the  application  of  the  TPS  being 
modeled . 

In  this  dissertation,  we  will  limit  our  discussion  to  a 
single  processor,  uniprogramming  TPS  which  support  a large 
number  of  terminals  (as  is  usually  true  in  minicomputer 
transaction  processing  systems) . We  can  model  this  class  of 
TPS  as  a single  server  queueing  system  with  multiclass 
Poisson  arrivals  and  general  service  time  distributions.  In 
Kendall's  notation  IKEND51],  this  model  is  called  a 
multiclass  M/G/l  queueing  system.  (M/G/l  denotes  Poisson 
arrivals,  general  service  time  distribution  and  a single 
server.  The  letter  M denotes  the  Markovian  property  of 
Poisson  arrival.) 
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1 . 3 Review  and  Summary  of  Priority  Scheduling 


A scheduling  discipline  in  a queueing  system  is  a 
decision  rule  for  the  service  facility  to  choose  among  jobs 
in  the  queue  the  next  one  for  service.  When  we  refer  to  a 
scheduling  discipline  as  priority  scheduling,  it  is  assumed 
that  arriving  jobs  are  grouped  into  classes  based  on  some 
external  characteristics,  and  the  rule  of  scheduling  depends 
specifically  on  the  priority  class  to  which  a job  belongs. 
The  selection  decision  may  or  may  not  depend  on  other 
characteristics  of  the  system,  such  as  the  queue  length,  the 
arrival  pattern,  the  service  time  required,  or  service  so 
far  rendered. 


The  following  notation  will  be  used  throughout  this 
dissertation:  For  an  n-class  priority  queueing  system,  we 
let 


E [Si] 
E [S?] 


E [Wi] 


P 


M- 


Poisson  arrival  rate  for  class  i jobs. 

Random  variable  for  service  time  of  class  i jobs. 
Expected  service  time  for  class  i jobs. 

Second  moment  of  service  time  of  class  i jobs. 

1/E  [Si],  the  service  rate  for  class  i jobs. 
i.j/^i  , utilization  factor  for  class  i jobs. 
Expected  waiting  time  in  queue  for  class  i jobs. 
X1  + ^2  + • • * + *n  ' tota^  arrival  rate. 

P1  + °2  + ’ * * + °n  ' utilization  factor. 

p1+  P2  + •••  + , partial  sum  of  utilization. 

X/p  , averaged  total  service  rate. 
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Unless  otnerwise  stated,  we  will  assume  p < 1.  We  will 
confine  out  discussion  to  scheduling  of  single  server 
queueing  systems  with  Poisson  arrivals  and  general  service 
time  distributions.  The  class  of  scheduling  that  we  will 
study  is  called  "work-conserving",  which  is  defined  as  the 
following  [KLEI65J : 

DEFINITION:  A scheduling  discipline  is  work-conserving  if 

it  has  the  following  properties: 

(i)  The  processor  is  always  kept  busy  as  long  as 
there  are  jobs  in  the  system  to  be  processed. 

(ii)  The  total  processing  time  required  by  any 
job  is  not  affected  by  the  scheduling  decision. 

A scheduling  is  called  nonpreemptive  if  once  a job 
starts  its  service,  it  will  be  served  till  completion.  A 
scneuuling  is  preemptive  if  it  allows  interruption  and  gives 
service  to  another  job  with  higher  priority.  We  will  limit 
our  discussion  to  nonpreemptive  scheduling  rules  that  are 
work-conserving . 

Below  we  summarize  some  results  which  are  relevant  to 
our  work.  First,  we  have  the  following  theorem  due  to 
Kleinrock  (KLEIbSj  (see  also  [SCHR7U ] and  [HOAR72]). 

THEOREM  1.1.  In  multiclass  M/G/l  queueing  systems  under  all 

nonpreemptive  work-conserving  scheduling 

n 

disciplines,  the  quantity  z p E[W,],  which  is  a 

k=l 
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weighted  sum  of  mean  waiting  times  for  all 
classes,  is  an  invariant,  and  it  is  equal  to 
n , 

p(  z \.EIS‘J)  / 2 ( 1-  p)  . 
k=l  K K 

Now,  if  we  let 

n 2 

(1.3.1)  W = ( £ \.E[S^])  / 2 (1- p)  , 

k=l  K 

then  the  above  theorem  becomes: 

(1.3.2)  PJE[W1]  + p2e[w2]  + ...  + enE[wn]  = pW. 

It  is  noted  that  W depends  only  on  the  first  and  second 
moments  of  the  service  time  variable,  rather  than  on  the  whole 
service  time  distribution. 

The  relation  (1.3.2)  is  generally  refered  to  as  the 
conservation  law  of  mean  waiting  times.  This  conservation 
law  puts  a linear  equality  constraint  on  the  set  of  mean 
waiting  times  {E [Wi] } . Any  attempt  to  modify  the  queueing 
discipline  so  as  to  reduce  one  of  the  E[Wi] 's  will  force  an 
increase  in  some  other  E[Wj].  However,  this  need  not  be  an 
"even  trade"  since  the  weighting  factors  may  be  different. 

Given  n classes  of  arrivals,  if  we  disregard  their 
priority  ana  serve  them  strictly  on  a f irst-come-f irst-serve 
( FCFS ) basis,  we  have  the  following  Pollaczek-Khinchin 
formula  for  mean  waiting  times  (See,  e.g.,  IWOLF70]): 
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THEOREM  1.2.  In  M/G/l  queueing  systems  with  multiclass 


arrivals,  under  FCFS  scheduling,  the  mean 
waiting  times  for  all  classes  are  all  equal  to 
W.  That  is,  we  have 

U-3.3)  E^W1^FCFS  = E^W2^FCFS  = * = E ^Wn^  FCFS  = W’ 

We  will  interchangably  use  E^WpCFS^  and  w to  represent 
the  same  quantity  as  defined  in  (1.3.1). 

Among  other  priority  queueing  disciplines,  the 
nonpreemptive  head-of-the-line  (HOL)  scheduling  studied  by 
Cobham  [COBH54]  is  the  scheduling  rule  most  commonly  known. 
This  discipline  is  also  known  as  fixed  priority  scheduling 
or  strict  priority  scheduling.  We  will  call  it  by  the  name 
"fixed  priority  scheduling." 

Under  this  scheduling  rule,  jobs  are  grouped  into 
different  priority  classes.  Upon  arrival  of  a job,  it  joins 
the  queue  for  the  priority  class  that  it  belongs  and  becomes 
the  last  job  in  the  queue.  When  the  service  facility  is 
available,  the  scheduler  searches,  starting  from  the  highest 
priority  class  and  moves  downward  to  lower  classes,  to  find 
the  first  nonempty  queue.  The  first  job  from  this  queue  is 
then  taken  for  service.  The  scheouling  mechanism  is 
illustrated  in  Figure  1.1.  We  note  that  within  the  same 
priority  class,  joos  will  be  selected  on  a f irst-come-f irst- 
served  (FCFS)  basis.  The  convention  of  denoting  higher 
priority  class  with  smaller  class  indices  will  be  followed. 
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Figure  1.1  Fixed  Priority  Scheduling  Mechanism 

The  mean  waiting  time  for  each  priority  class  under 
this  scheduling  rule  is  given  by  the  following  theorem 
[COBH54 ] : 

THEOREM  1.3.  In  multiclass  M/G/l  queueing  systems  under 
fixed  priority  scheduling,  the  mean  waiting  time 
for  each  priority  class  is  given  by 

(1.3.4)  ElWilpp  = (1-P)  W/  ( 1 ~ o' i_j_)  (1-CT±)  - d<i<n) 

As  a corollary,  we  have  the  following: 

(1.3.5)  (1-P)  W / (1-PL)  - ElWL]pp  < W, 
and 


(1.3.6) 


W < ElWn]  __  = W / (1-P+Pj  . 


The  following  theorem  addresses  the  problem  of  optimal 
ordering  of  priority  classes.  This  result,  generally  known 
as  "the  tic  rule  for  Poisson  arrivals",  can  be  established 
by  an  interchange  argument  using  the  conservation  equation 
(1.3.2)  (See,  [BROS63]). 


THEOREM  1.4.  In  multiclass  M/G/l  queueing  systems  under 
fixed  priority  scheduling,  in  order  to  minimize 
n 

(1.3.  / ) C = £ i^c  k E [ w k]  , 

k=  1 

priority  ordering  should  be  assigned  in  the 
descending  order  cf  w- ic  i*  That  is,  higher 
priorities  are  given  to  classes  with  larger 
values  of 

In  the  aoove  theorem,  c^  can  be  thought  of  as  the  cost 
of  unit  time  of  waiting  per  job  from  class  i.  In 
particular,  if  we  let  c / = then  c V becomes  the  total 

cost  of  waiting  per  unit  time  for  class  i arrivals.  In  this 
case,  we  have  the  following  corollary. 

COROLLARY : In  order  to  minimize 

n 

(1.3.8)  C'  = £ c£E[Wk]  , 

k — 1 

priority  ordering  should  be  assigned  in 
descending  order  of 
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Next,  we  define  the  rule  we  call  "escalating  priority 
scheduling".  This  rule  was  first  proposed  and  studied  by 
Kleinrock  [KLEIt>4]. 

DEFINITION.  In  escalating  priority  scheduling,  there  is  a 
parameter  («^>0)  associated  with  each 

priority  class  i.  When  a job  from  class  i 

arrives  to  the  system,  its  priority  index 
"escalates"  linearly,  starting  from  0,  with  rate 
Whenever  the  service  facility  becomes 
available,  the  job  with  the  highest 

instantaneous  priority  index  is  then  selected 
for  service.  A tie  is  broken  by  any  arbitration 
rule . 

Theoretically,  the  arbitration  rule  for  resolving  two 
jobs  with  the  same  priority  indices  is  unimportant,  since 
the  probability  of  this  event's  happening  is  zero. 
Practically,  however,  we  can  either  give  preference  to  the 
job  which  has  waited  longer  (i.e.,  following  FCFS  rule),  or 
to  the  job  belonging  to  a higher  priority  class  but  has 
waited  for  a shorter  time  (i.e.,  according  to  the 
priorities) . 

Under  this  scheduling,  the  priority  function  for  a job 
from  class  i which  arrives  to  the  system  at  time  is, 


19 


q^t)  = a±  ( t - t±  ) 

where  t ranges  from  until  this  job  obtains  service. 
Thus,  priority  of  each  job  changes  dynamically.  The  longer 
a job  waits,  the  higher  its  priority  becomes.  Taking 
waiting  time  into  consideration,  this  scheduling  rule  gives 
priority  to  jobs  that  have  been  in  the  queue  for  a long 
time,  even  though  they  may  belong  to  low  priority  classes. 


Figure  1.2  illustrates  the  priority  functions  of  two 
jobs  from  different  priority  classes  which  enter  the  system 
at  two  different  times.  Specifically,  at  time  , a job 
from  priority  class  i arrives,  and  attains  priority  at  a 
rate  equal  to  a^.  At  a later  time  t , another  job  from 
priority  class  j (j<i)  enters  the  system,  and  attains  its 
priority  at  a higher  rate  cty  The  priority  of  this  job  from 
class  j will  catch  up  with  that  of  the  job  from  class  i at 
time  t *.  Should  the  server  become  available  any  time  before 
t *,  the  job  from  class  i will  obtain  service  in  preference 
to  the  job  from  class  j,  despite  that  it  is  from  a "lower" 
priority  class  - a class  with  "lower"  priority  increasing 
rate.  On  the  other  hand,  if  the  service  facility  is 
occupied  until  after  t *,  then,  when  the  server  is  available, 
the  job  from  class  j will  be  chosen. 


We  note  that  under  the  rule  of  escalating  priority 
scheduling,  preferential  treatment  is  given  to  classes  with 
larger  values  of  the  parameter  a^:  For  two  jobs  arriving  to 
the  system  at  the  same  time,  the  one  with  larger  value  of  a^ 
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gains  priority  at  a faster  rate,  hence  will  be  served 
earlier  . 


Figure  1.2  Interaction  between  Priority  Indices 
in  Escalating  Priority  Scheduling 

The  following  theorem,  due  to  Kleinrock  [KLEI64] , gives 
the  behavior  of  mean  waiting  time  of  each  priority  class 
under  escalating  priority  scheduling. 

THEOREM  1.5.  In  multiclass  M/G/l  queueing  systems  under 
escalating  priority  scheduling  and 

al  2.  a2  2 •••  2 °n'  the  following  set  of  n 
linear  equations  hold:  For  each  i,  l£i<n, 

(1.3.9)  {1-2  (1-  p.  }E  [Wi]  + £ (1-— )p,E[W.]  =W. 

k=l  ak  k k=i+l  °i  k k 

The  mean  waiting  times  for  each  priority  class  i 
can  thus  be  obtained  recursively  (from  n 
backwards  to  1)  as 
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(1.3.10) 


E[Wi]  = 


W-  L (l--^)p  E(W  ] 
k=i+ 1 ai  K K 
i—  1 # . 

1-  E (l-~ )PU 


Note  that  we  have  used  the  convention  to  interpret  a 
null  summation  as  0 in  equations  (1.3.9)  ana  (1.3.10). 


The  aependence  ol  E[Wi)  on  the  set  of  parameters  { o^} 
in  (1.3.10)  shows  tnat  the  relative  mean  waiting  times  of 
different  priority  classes  can  be  adjusted.  This  makes 
escalating  priority  scheduling  more  attractive  than  the 
conventional  fixed  priority  scheduling,  since  it  is  possible 
to  change  the  values  of  the  parameters  for  a more  desirable 
system  performance. 


We  remark  that  since  this  nonpreemptive  scheduling  is 
work-conserving,  the  conservation  law  requires  a linear 
constraint  (1.3.2)  on  the  performance  of  mean  waiting  times 
of  all  priority  classes. 


When  Kleinrock  first  investigated  this  scheduling  rule, 
he  proved  the  above  theorem  only  for  the  case  when  service 
time  distributions  are  all  exponential.  In  his  entire  proof 
of  this  theorem,  only  the  mean  of  waiting  time  distribution 
is  used.  Since  this  mean  waiting  time  is  only  a function  of 
tne  first  and  second  moments  of  the  service  time 
distributions,  and  this  is  true  also  for  the  M/G/l  system  as 
given  by  equation  (1.3.2),  the  same  theorem  must  also  hold 


for  M/G/l  system  by  using  the  values  of  W given  by  (1.3.2). 
When  Kleinrock  summarized  this  result  later  in  his  book 
[KLEI 76 ] , he  seemed  to  imply  this,  although  he  did  not 
clearly  make  the  point.  (He  had  kept  the  sentence  "we  study 
this  system  for  the  case  of  exponential  service  times  ..." 
in  his  book.) 

1 . 4 Problem  Definition  and  Overview 

As  we  stated  earlier,  in  transaction  processing 
systems,  priority  scheduling  is  required  in  order  to  give 
different  response  times  to  different  types  of  transactions. 
In  many  existing  TPS,  fixed  priority  scheduling  is 
implemented.  When  these  systems  are  operating  under  heavy 
congestion,  many  low  priority  transactions  suffer 
excessively  long  response  times.  However,  it  is  possible 
that  some  high  priority  transactions  are  still  obtaining 
unnecessarily  fast  responses.  This  suggests  that  a 
scheduling  rule  other  than  fixed  priority  scheduling  is 
desirable . 

Studies  of  human  behavior  in  man-machine  interactions 
have  shown  that  tong  response  times  will  interrupt  the 
continuity  of  human  thinking,  and  will  drasically  reduce  the 
productivity  of  the  terminal  users  (See,  [MILL68]  and 
[CARB68 ] ) . In  general,  it  is  considered  that  for  response 
times,  the  faster,  the  better.  However,  we  all  know  that 
there  is  also  a human  reaction  limit.  A response  time  that 
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is  faster  than  what  a human  being  can  react  is  not 
necessary.  Figure  1.3  shows  the  situation  of  response  times 
under  fixed  priority  scheduling,  together  with  a desirable 
performance  that  lies  within  the  two  human  response  limits. 


Figure  1.1  Response  Times  Behavior  under 
Fixed  Priority  Scheduling  Rule 

The  above  discussion  suggests  that  it  is  desirable  to 
have  a scheduling  rule  to  provide  reasonable  response  times 
to  low  priority  transactions  so  that  the  continuity  of  human 
thinking  will  not  be  interrupted.  The  escalating  priority 
scheduling  rule  appears  to  be  a good  scheduling  rule  to 
consider  because  it  can  discriminate  against  high  priority 
transactions  in  not  providing  faster  response  time  than 
necessary  and  thus  allow  some  of  the  TPS  resources  to 
respond  to  lower  priority  transactions. 
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In  analyzing  queueing  models,  it  is  generally  desirable 
to  obtain  measures  other  than  just  mean  waiting  times.  For 
analyzing  multiclass  M/G/l  queueing  models,  in  order  to 
obtain  the  waiting  time  distributions,  specific  service 
times  distributions  must  generally  be  used  and  even  then,  it 
is  usually  difficult  to  obtain  analytic  results.  In 
studying  escalating  priority  scheduling  for  TPS,  we  will 
limit  our  investigations  to  the  mean  and  variance  of  waiting 
times . 

Some  of  the  questions  that  need  to  be  answered  in 
determining  if  escalating  priority  scheduling  is  indeed 
applicable  to  transaction  processing  systems  are: 

(1)  What  is  the  feasible  performance  space  of  mean 
waiting  times  under  this  scheduling?  What  are  the  bounds  of 
mean  waiting  time  for  each  priority  class? 

(2)  How  do  we  find  the  values  of  the  control  parameters 
that  will  optimize  different  cost  objective  functions? 

We  will  answer  these  questions,  plus  investigating  some 
other  aspects  of  escalating  priority  scheduling,  in  this 
dissertation.  In  Chapter  2,  we  investigate  the  limitations 
of  this  scheduling,  derive  the  feasible  performance  space  of 
mean  waiting  times  ana  show  that  the  range  of  this 
scheduling  rule  is  from  fixed  priority  scheduling  to 
f irst-come-f irst-serve  scheduling  with  no  priorities. 


25 


In  Chapter  3,  we  study  the  problem  of  optimization  for 
several  cost  functions.  Our  solution  procedure  is  divided 
into  two  stages:  First,  we  determine  a set  of  optimal  mean 
waiting  times  for  a given  cost  function.  Then,  we  determine 
the  values  of  control  parameters  for  this  set  of  mean 
waiting  times. 

Chapter  4 discusses  several  other  aspects  of  escalating 
priority  scheduling,  such  as  bounds  on  the  ratios  of  mean 
waiting  times,  the  behavior  of  these  ratios  of  waiting  times 
under  saturated  congestion  and  variances  of  waiting  times 
under  different  levels  of  discrimination. 

In  Chapter  5,  we  compare  the  behavior  of  escalating 
priority  scheduling  with  other  "adjustable  priority 
scheduling  rules",  discuss  the  use  of  escalating  priority 
scheduling  in  transaction  processing  systems,  and  give 
conclusions  and  directions  for  future  research. 
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CHAPTER  2 


ESCALATING  PRIORITY  SCHEDULING  DISCIPLINE 

2 . 0 Introduction 

We  have  seen  in  Chapter  1 that  escalating  priority 
scheduling  has  a set  of  control  parameters.  By  adjusting 
the  settings  of  these  parameters,  different  mean  waiting 
times  can  be  achieved.  In  this  chapter,  we  study  the 
limitations  of  this  scheduling,  derive  the  feasible 
performance  space  of  mean  waiting  times  for  each  class  of 
jobs,  ana  aevelop  an  algorithm  for  determining  values  of  the 
control  parameters  given  a set  of  feasible  expected  waiting 
times . 

In  Section  2.1,  we  introduce  a general  concept  of 
scheduling  rules  with  adjustable  parameters.  For  this  class 
of  scheduling  rules,  we  derive  a set  of  constraints  (bounds) 
on  the  mean  waiting  times  of  jobs  from  different  priority 
classes  ana  show  that  they  can  cover  the  spectrum  of 
scheduling  from  no  discrimination  to  maximum  discrimination 
among  priority  classes  by  adjusting  their  parameters. 

In  Section  2.2,  we  show  that  the  set  of  constraints 
derived  in  Section  2.1  is  sufficient  for  characterizing  the 
feasible  performance  space  of  mean  waiting  times  under 
escalating  priority  scheduling.  That  is,  any  combination  of 
mean  waiting  times  satisfying  this  set  of  constraints  can  be 
achieved  by  escalating  priority  scheduling.  We  then  develop 
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an  algorithm  for  determining  the  values  of  the  control 
parameters  given  any  set  of  feasible  mean  waiting  times. 

In  section  2.3,  we  discuss  interpretation  of  boundary 
points  of  the  feasible  performance  space  under  escalating 
priority  scheduling,  ana  show  that  non-extreme  points  on  the 
bounaary  leaa  to  a natural  definition  of  a mixed  rule  of 
escalating  priority  and  fixed  priority  scheduling. 

2 . 1 Adjustaple  Priority  Scheduling  Rules 

In  multiclass  queueing  systems,  priority  scheduling  is 
used  to  discriminate  among  different  classes  of  arrivals. 
Some  of  tne  scheduling  rules  are  furnished  with  a set  of 
parameters  that  can  be  adjusted  to  provide  different 
discriminations  among  the  different  classes  of  jobs.  We 
call  this  class  of  scheduling  rules  the  "adjustable  priority 
scheduling  rules",  and  define  it  formally  as  the  following: 

DEFINITION.  In  an  adjustable  priority  scheduling  rule, 
there  is  a set  of  parameters  that  can  be  varied 
to  enforce  different  degrees  of  discriminations 
among  priority  classes. 

In  adjustable  priority  scheduling  rules,  the  priority 
index  of  a job  must  be  a function  of  more  than  the  parameter 
associated  with  the  priority  class,  otherwise  the  scheduling 
rule  will  become  fixed  priority  scheduling  without  any 
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index 


function 


adjustability.  Usually,  the  priority 
depends,  in  addition  to  the  parameter  associated  with  the 
priority  class,  on  the  history  or  the  current  state  of  the 
system.  For  example,  in  escalating  priority  scheduling,  the 
priority  inoex  of  a job  depends  on  both  the  waiting  time  and 
the  parameter  associated  with  the  job.  Other  examples 
include  Earliest  Due  Date  scheduling  and  deescalating 
priority  scheduling  (these  scheduling  rules  are  discussed  in 
Chapter  5.) 

In  the  following,  we  investigate  the  limitations  of 
mean  waiting  times  under  any  adjustable  priority  scheduling 
rule.  Please  recall  that  we  have  restricted  ourselves  to 
scheduling  rules  that  are  both  work-conserving  and 
nonpreemptive . First,  we  have  the  following  theorem. 

THEOREM  2.1.  In  multiclass  M/G/l  queueing  systems,  under 
any  nonpreemptive  adjustable  priority  scheduling 
rule,  the  mean  waiting  times  for  each  class  is 
bounded  by: 

(2.1.1)  (1- p)  W/  (1- p £ £E[Wi]  £W/(l-p+oi).  (l<i<n) 

Proof : For  each  priority  class  i,  the  maximum  preference 

tnat  can  be  given  to  this  class  under  any  adjustable  priority 
scheduling  rule  is  when  jobs  from  this  class  are  given 
utmost  preference  over  any  other  classes'  jobs.  Under  this 
condition,  no  job  from  any  other  class  will  ever  be  selected 


for  service  as  long  as  there  are  jobs  from  class  i present 
in  the  system.  But  this  is  exactly  what  happens  when  the 
system  is  operated  under  fixed  priority  scheduling  that 
gives  class  i the  highest  prioity.  Thus,  the  mean  waiting 
time  of  class  i when  it  is  set  as  the  highest  priority  class 
under  fixed  priority  scheduling  gives  the  lower  bound  of 
E[w/i]  for  any  adjustable  priority  scheduling  rule.  From 
(1.3.5)  of  Theorem  1.3,  we  have  the  first  half  of  inequality 
set  (2.1.1) . 

Similarly,  the  maximum  discrimination  that  can  be 
applied  against  class  i under  any  adjustable  priority 
scheduling  rule  is  when  jobs  from  this  class  are  given  least 
preference.  As  long  as  there  are  jobs  from  other  classes 
present  in  the  system,  no  job  from  class  i will  be  selected 
for  service.  This  is  what  happens  in  fixed  priority 
scheduling,  when  class  i is  set  as  the  lowest  piiority 
class.  Therefore,  the  upper  bound  of  E(Wi]  can  be  obtained 
from  (1.3.5)  of  Theorem  1.3,  which  is  the  second  half  of 
inequality  set  (2.1.1). 

In  the  following,  we  show  that  not  only  is  the  mean 
waiting  time  of  each  priority  class  bounded,  but  a weighted 
sum  of  the  mean  waiting  times  for  any  number  of  classes  is 


also  bounded. 


nonpreemptive  adjustable  priority  scheduling 
rule,  for  any  set  I of  priority  class  indices, 
we  have  the  following  bounds  on  a weighted  sum 
of  mean  waiting  times  for  jobs  with  class 
indices  in  this  set: 


(2.1.2)  (1-p)  ( E p.  )W/(1-  I p.  ) < £ P > E [ W,  ] < ( £ p )W/(1-P+  £ p.  ) . 

k€l  * k€l  * k€ I kfcl  K k€i 


In  the  above  expression,  the  notation  ^ denotes  the 

k€l 

summation  for  all  classes  witn  class  index  in  I.  We  will 
exclude  the  case  when  I is  equal  to  the  set  of  all  priority 
indices,  because  in  this  case,  (2.1.2)  becomes  the 
conservation  constraint  (1.3.2). 


Proof : For  any  given  I as  the  set  of  class  indices  of  an 

arbitrary  group  of  priority  classes,  let  the  mean  waiting 
time  of  all  jobs  in  this  group  be  ElW^].  Let  N denote  the 
set  of  all  class  indices.  Tnat  is,  N = {1,  2,  ...,  n}.  Let 
N-I  aenote  the  set  of  class  indices  which  are  not  in  I. 
Then  if  we  look  at  the  classes  with  class  indices  in  I as 
distinct  classes,  from  the  conservation  law  we  nave 

Z p ElW . ] + Z PlEIW.  ] = pW. 
k€l  * K k€N-I 

Now,  if  we  consider  the  whole  group  of  priority  classes 

with  class  indices  in  I as  a single  class,  its  congestion 

factor  is  then  equal  to  Z pv.  From  this  point  of  view,  the 

k€l 
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conservation  equation  becomes 


( E pj  E[W_]  + E oE[Wv  ] * o W. 
kti  K 1 k€N-I 


From  the  above  two  equations,  we  have 


(2.1.3)  E[W_]  = E p E[W.  ] / E p 

1 kfcl  K K k€l 


Now,  for  tne  mean  waiting  time  ElWj]  of  all  jobs  in 
this  "combined  class",  under  any  adjustable  priority 
scheduling  rule,  from  Theorem  2.1,  we  have 

(l-p)W/(l-  E pj  < E(Wt]  < W/  (1-0  + E p ). 

k6i  K 1 k€l 

Substituting  (2.1.3)  into  the  above  expression,  and 

multiplying  by  the  quantity  ( 2 p ) throughout,  we  obtain 

k€l  k 

(2.1.2)  . 

We  remark  that  in  Theorem  2.2,  for  the  special  case 
when  the  set  I consists  of  only  one  element  (e.g.,  I = { i } ) , 
then  (2.1.2)  becomes 

(1- p)  p±W/  (1- pi)  _<  p ±E  [ W i ] _<  Pi  W/  (1- p+p  i)  . 

Cancelling  out  the  factor  p^,  this  becomes  an  inequality  of 

(2.1.1) .  Thus,  if  we  allow  I to  be  any  set  of  class 
indices,  then  (2.1.2)  includes  (2.1.1)  as  a special  case. 

We  will  now  examine  the  two  sets  of  inequalities  in 

(2.1.2) ,  and  show  that  when  the  conservation  equation 

(1.3.2)  holds,  these  two  sets  of  inequalities  can  be  derived 
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from  each  other: 

From  tne  first  set  of  inequalities,  for  each  I,  we  have 
E P [ W,  ) > (1-p)  W ( E P,)/(1-  I ok)  • 

kei  k k “ k€i  k kei 

From  the  conservation  law,  we  have 
(1.3.2)  PjEtWjJ  + P2E[W2]  + ...  + PnE[Wn]  = P W, 

Taking  the  difference  of  the  above  two  equations,  we 
then  have 

E p.EfW,]  < pW-  ( 1—  p ) W ( E P,)/(1-  S Pk) 
k€N-I  k k£l  k€l 

= (pW-  ( £ o.  )pW-  ( E p.)W+  ( E p )pWj/(l-  E P,) 

ktN-I  k k€l  k k6l  K k€I  K 

= ( E p.)W/  (1-p  + T.  Pk)  • 
k€N-I  ktN-I 


This  is  an  inequality  in  the  second  set  of  (2.1.2).  Since 
this  is  true  for  all  i,  all  inequalities  in  the  second  set 
of  (2.1.2)  can  be  derived  from  the  first  set  of 
inequalities . 

Similarly,  if  we  take  the  difference  of  (1.3.2)  with 
any  inequality  in  the  second  half  of  (2.1.2),  we  obtain  an 
inequality  in  the  first  half  of  (2.1.2).  This  holds  true 


p. 


w 


Since  the  conservation  law  holds  for  any  nonpreemptive 
adjustable  priority  scheduling,  only  one  set  of  inequalities 
in  (2.1.2)  is  needed  to  characterize  the  mean  waiting  times 
behavior . 

Summarizing  the  above  discussion,  we  have  the  following 
necessary  conaitions  for  any  nonpreemptive  adjustable 
priority  scheduling  rule: 

(1.3.2)  p1E[W1J  + p2EtW2)  + ...  + PnE[Wn]  = p W, 
and 

(2.1.4)  VlcN  z Pv  E (W,  ) > (1-0)  ( I p,)W/(l-  I Pk)  . 

In  the  above  expression,  we  have  used  the  notation  VlcN 
to  mean  "for  each  proper  subset  I of  N"  , where  a proper 
subset  of  N is  defined  to  be  a nonempty  subset  of  N that  is 
not  equal  to  N itself  [LIPS64],  Thus,  for  example,  when  N = 
{1,  2,  3},  then  I can  be  any  of  tne  following:  {1},  {2}, 
{3},  (1,  2},  {1,  3}  and  (2,  3}.  Therefore,  when  we  require 
I to  run  through  all  the  possible  subsets  of  N,  (2.1.4) 
becomes  a set  of  six  inequalities.  We  remark  that  for  a set 
N of  n elements,  (2.1.4)  consists  of  2n-2  inequalities. 

The  above  summary  states  that  for  any  setting  of  an 
adjustable  priority  scheduling  rule,  the  point  W = 
(E [W^J , E [W2 ],..., E [ W nl ) must  lie  within  the  space  defined  by 
(1.3.2)  and  (2.1.4).  In  other  words,  the  space  defined  by 
(1.3.2)  and  (2.1.4)  is  the  maximum  possible  performance 


space  of  mean  waiting  times  under  any  adjustable  priority 
scheduling  rule. 

We  now  give  the  geometrical  representations  of  these 
constraints  in  two  and  three  dimensional  spaces. 

Figure  2.1  shows  the  bounds  of  mean  waiting  times  in  a 
two-class  queueing  system  under  any  adjustable  priority 
scheduling  rule.  The  conservation  equation  (1.3.2)  in  the 
two-dimensional  space  is  a straight  line  L: 
P1E^W1^  + p ^ l W 2 1 = pW.  The  set  of  constraints  (2.1.4) 
consists  of  two  inequalities: 

(a)  > (1- p)  p-j^W/  (1-  p ; 
and  (b)  p2E[W2]  > ( 1 - p ) p 2 W/  ( 1 - p 2)  . 

Let  L j:  p E [ W -^ ) = (1- p)  P-^W/  (1- p j)  and  L2:  P2E[W2]  = 
(1- p) p2W/ (1- p ^ correspond  to  the  boundaries  of  the  above 
two  inequalities.  Then  we  see  that  L and  L intercept  at 
U 12  = ( (1- p)  W/ (1- p j)  , W/(l-pj));  while  L2  and  L intercept 
at  021  = (W/(l-p^,  (1-P)W/(1-  p,))  . 

Thus,  the  maximum  performance  space  of  mean  waiting 
times  is  the  line  segment  0^2^21'  ^-s  interesti-n9  to  note 
that  point  P = (W,  W)  lies  on  the  line  Qi2^21*  This  point 
corresponds  to  a scheduling  rule  which  applies  no 
discrimination  among  the  two  priority  classes  (e.g.,  FCFS  or 
last-come-f irst-serve  (LCFS)  scheduling) . 


Figure  2.1  Bounds  of  Mean  Waiting  Times 
in  a Two-Class  Queueing  Model 

We  note  that  for  points  in  between  P and  Q^2'  we  have 
E[W^]  < £ [ W7 ) . Tnus,  they  correspond  to  scheduling  rules 
that  give  preference  to  class  1.  Similarly,  points  lying 
between  P and  Q correspond  to  scheduling  rules  that  give 
preference  to  class  2.  The  extreme  point  Qj2  corresponds  to 
the  mean  waiting  times  performance  of  fixed  priority 
scheduling  which  gives  high  priority  to  class  1.  Similarly, 
point  Q £ ^ corresponds  to  mean  waiting  times  performance  of 
fixed  priority  scheduling  with  reversed  ordering  of 
pr ior ities . 

Next,  we  will  discuss  3-class  queueing  systems.  In 
3-dimensional  space,  the  conservation  equation  (1.3.2) 
Decomes  a plane  S:  p^ElWjJ  + P2E[W2]  + p n E [ ] =pW.  It  is 
snown  as  the  plane  passing  through  the  points  XI,  X2,  X3  in 
Figure  2.2. 


In  this  case,  constraints  (2.1.4)  is  the  following  set 
of  2^-2  or  6 inequalities: 

(1)  p1£[W1]  > (1-p)  PlW/(l-Pl) 

U)  p2EIW2]  > (1-p)  p2w7(l-p2) 

(3)  p3e[w31  > (1-p)  p3w/(i-p3) 

(4)  Px  E [Wx]  + p2E[W2J  > (1-p)  (P1  + P2)W/(l-p1-p2) 

(5)  Px  E IW1 J + p3E[W3]  > (1-p)  (Pl  + p3)W/(l-Pl-p3) 

(b)  P2  ELW2]  + P 3 E tw3]  > (1-p)  (p2  + p3)W/(l-p2-p3) 


Figure  2.2  Bounds  of  Mean  Waiting  Times 

in  a Three-Class  Queueing  Model 


The  six  lines  corresponding  to  the  boundaries  of  the 
above  six  inequalities  are  shown  as  through  in  Figure 
2.2.  The  six  vertices  Q123>  Q213,  Q231,  <i321.  Qn2  and  Q132 
correspond  to  the  mean  waiting  times  performances  of  the  six 
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different  orderings  under  fixed  priority  scheduling.  The 
subscript  of  each  point  represents  the  ordering  of  priority 
classes.  Thus,  for  example,  Q213  represents  the  mean 
waiting  times  performance  of  fixed  priority  scheduling  when 
class  2 is  given  highest  priority,  class  1 the  next,  and 
class  3 the  lowest  priority.  Once  again,  the  point  P = (W, 
W,  W)  on  the  plane  corresponds  to  scheduling  with  no 
discriminations  among  the  three  priority  classes. 

From  the  above  two  examples,  we  see  that  the  space 
defined  by  (1.3.2)  and  (2.1.4)  is  a convex  polyhedron.  It 
can  be  thought  of  as  a region  on  the  "hyperplane"  defined  by 
the  conservation  equation  (1.3.2)  that  is  c ir cumf erenced  by 
the  set  of  linear  inequalities  (2.1.4). 

We  note  that  the  point  P = (W,  W,  . . . , W)  always  lies 
in  this  space  because  it  satisfies  both  (1.3.2)  and  (2.1.4). 
This  point  corresponds  to  mean  waiting  times  performance  of 
scheduling  rules  that  have  no  discrimination  (FCFS,  LCFS,  or 
other  rules  that  do  not  apply  discrimination  among  priority 
classes) . 

We  also  note  that  from  the  proof  of  Theorem  2.1,  the 
bounds  of  adjustable  priority  schedulings  are  obtained  from 
the  mean  waiting  times  of  fixed  priority  scheduling,  which 
can  be  achieved  by  using  either  FCFS  or  LCFS  to  determine 
the  ordering  of  jobs  within  the  same  class. 
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Therefore,  this  class  of  adjustable  priority  scheduling 
rules  covers  the  spectrum  of  scheduling  rules  which  do  not 
distinguish  among  priority  classes  to  those  that  separate 
priority  classes  to  the  maximum  extent. 

2 . 2 Feasible  Performance  Space  of  Mean  Waiting  Times 

In  Section  2.1,  we  showed  that  mean  waiting  times 
performance  of  any  adjustable  priority  scheduling  must 
satisfy  the  following  constraints: 

(1.3.2)  P1EtW1]  + P2E[W2]  + ...  + °nE[Wn]  = PW, 

and 

(2.1.4)  VICN  I ovE  IW^.]  > (1-P)(  I P„)W/(1-  T PV). 

k€I  k€l  k6l 

Since  these  constraints  are  linear,  the  region  bounded 
by  these  constraints  is  a convex  polyhedron.  The  interior 
points  of  this  space  are  defined  by  (1.3.2)  and 

(2.2.1)  YIcN  r p,  E [W.  ] > (l-p)(  E p.)W/(l-  E p.)  , 

k€l  * k€I  k€i 

where  (2.2.1)  is  obtained  by  replacing  with  ">"  in 

(2.1.4)  . 

In  this  section,  we  show  that  we  can  find  a set  of 
values  for  the  control  parameters  { ai } in  escalating 
priority  scheduling  for  any  set  of  mean  waiting  times 
satisfying  (1.3.2)  and  (2.2.1).  Thus,  (1.3.2)  and  (2.2.1) 
characterize  the  feasible  performance  space  of  mean  waiting 
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times  under  escalating  priority  scheduling.  In  addition,  we 


present 

an  algorithm  for 

determining 

the  values 

of  { 

Qi  1 

given  a 

set  of  feasible  i 

Tiean  waiting 

times . 

Before  we  proceed 

to  derive 

the  procedure 

for 

determining  values  of 

the  control 

parameters 

{ } 

, we 

examine 

Theorem  1.5  more 

thoroughly . 

We  recall 

that 

in 

Theorem 

1.5,  equations 

(1.3.9)  and 

(1.3.10)  were 

obtained 

under 

the  condition 

that  o :> 

01 2—  •••  > *n  • 

This 

corresponds  to  the  setting  of  orderings  that  gives  class  1 
tne  highest  priority,  class  2 the  second,  and  so  on,  and 
class  n the  lowest  priority.  Under  this  setting  of 
priorities,  we  have  the  following: 


THEOREM  2.3.  In  multiclass  M/G/l  queueing  systems  under 
escalating  priority  scheduling,  if  we  are  given 
^ — a2—  * * * — / then 

(2.2.2)  E[W1)  < E[W2]  < ...  < E [ Wn  ] . 

Proof  : we  shall  show  that 

E[WjJ  < + for  all  i,l  < i<n-l  . 

From  (1.3.9),  we  have 


E[W.] 


- n 

W-  L (l--^)p ,E[W  ] 
k=i+ 1 Q i K K 


i-i 

1-  Z (i  “ )°k 

k=l  ai  K 
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r 


w-  2 (1  E[W,  ) 

k=i+2  ai  * 2_ 

i-1  aL 

1-  E (1  “ ~~)  Pk 
k=l  ak 


W-  E (1 )p  E[W  ] 

k-i+2  a i+  1 k k 
i—  1 a . , 

1-  E d--^)Pk 
k=l  ak  K 


W-  E (1 )p  E[W  ] 

k=i+2  ^i+l  _k 

i a , 

1-  E U--^)Pk 
k=l  “k  K 


- E[wi,-il 


Thus,  (2.2.2)  holds. 


14 


In  the  following  theorem,  we  describe  an  algorithm  for 
determining  the  values  of  { a^t  given  a set  of  mean  waiting 
times  (E  l W^J , Elw2l,  ...,  E(Wnl)  satisfying  (1.3.2),  (2.2.1) 
ana  (2.2.2) . 

THEOREM  2.4.  In  a multiclass  M/G/l  queueing  system  with 
P < 1,  a set  of  values  for  { can  be  obtained 

for  escalating  priority  scheduling  to  achieve 
any  given  set  of  mean  waiting  times  (E[W^], 
E [ W.  J , ...,  E l W nJ  ) that  satisfies  (1.3.2), 

(2.2.1)  and  (2.2.2).  Define,  recursively  (for  i 
from  n-1  backwards  to  1) 


(2.2.3) 

a . = (r . a.  . +p  . ) E iw  . 1 /E (W . . ] , 
i li+l  i i i-l 

(2.2.4) 

n 

bi=  (W-  Z okE[W,l  )/E[Wi_1)  - (l-ai_1), 
k=i 

(2.2.5) 

c.  = p . -b.  . - r . a ., 
i l l+l  ii+l' 

where  r^  is  the  unique  positive  root  of 

(2.2.6) 

ai+lX  + bi+lX  + ci+l  = 0 

ana,  initially, 

(2.2.7) 

an  * »„MWn]/E 

(2.2.8) 

bn  - <S-onE [Wnj )/E [W„_L]  - (l'»n_l>. 

(2.2.y) 

cn  ■ 

Then,  { } can  be  obtained  as: 

n-1 

(2.2.10)  »i  = 1 / n tk  i = 1,  2,  . . . , n. 

k=i 
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Note  that  in  (2.2.1U),  we  have  used  the  convention  to 
interpret  a null  product  as  1 for  the  case  when  i=n.  Also, 
r^  is  defined  only  for  l£i£n-l.  From  (2.2.10)  we  have  r^  = 
Since  we  are  given  that  E [W^  ] _<  E [ W 1 1 » therefore 
a^_>ai+1.  thus,  each  r^  will  be  smaller  than  or  equal  to  1. 

We  remark  that  since  the  conservation  law  puts  a linear 
constraint  on  the  performance  of  mean  waiting  times  for  each 
class  of  gobs,  there  are  only  n-1  degrees  of  freedom  to 
specify  a desirable  mean  waiting  times  performance.  For 
these  degrees  of  freedom,  only  n-1  independent  variables  are 
needed.  Therefore,  for  the  set  of  n control  parameters, 
there  is  one  degree  of  redundancy  and  only  n-1  degrees  of 
adjustability  can  be  achieved. 


Proof : Under  escalating  priority  scheduling,  for  any  set  of 
control  parameters  ( } , the  mean  waiting  times  { E [ Wi ] } 
must  satisfy  the  following  set  of  n equations  (from  Theorem 
1.5)  : 


(1.3.9)  [1-  V(l-%p  JE[W.]  + Z (1  )o,  E[W.  ] = W . 

k=l  “k  k 1 k=i+l  “i  k k 


Our  problem  now  is  to  determine  the  values  of  { ai  } 
given  E[W^],  l£i<n,  which  satisfy  (1.3.9)  above.  We  will 
first  solve  this  set  of  equations  for  { } . 

Let  us  define,  for  each  i,  l£i£n-l, 
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(2.2.11) 


r i = ai+l/Qi* 


Then,  we  see  that  for  any  k > i, 


i+1 


(i+2 


O' 


i+1 


k- 1 


k-1 
n r . 
k=i  3 


Similarly,  for  k < i, 


k+ 1 


k+2 

*k+l 


a • i-1 

— — = nr. 

® i— l j=k  ^ 


Thus,  (1.3.9)  can  be  rewritten  as 


i-1  n k-1 

(1-  Z (1-  n r . ) p.J  £ [W  ] + Z (1-  n r.)p  E[W  ] 
j=k  3 k k k=i+l  j=i  1 k * 


i-1 

Z 

k=l 


= W 


Shifting  those  terms  which  do  not  contain  the  factor  r-j 
to  the  right  of  the  equality  sign  and  dividing  by  E[Wi],  we 
ODtain 

i-1  i-1  n k-1 

(2.2.12)  Z o.  ( Hr)-  Z ( ovE  [W  ] /E  (W.  ] ) ( n r.) 

k=l  K j=k  J k=i+l  k k 1 j = i 3 

_ n 

= (W-  Z p E[W  ] )/E[W.]  - (1  - a.  ,)  . 
k=i+l  K * 1 1-1 


Equation  (2.2.12)  is  a set  of  n nonlinear  equations  of 
tne  n-1  variables  {r^:  l£i£n-l}  which  is  of  degree  n-1.  For 
later  reference,  the  i-th  equation  of  (2.2.12)  will  be 
defined  as  (2.2.12.i).  we  will  now  solve  this  set  of 
simultaneous  equations  for  r^,  l<i<n-l. 
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It  is  important  to  note  that  if  these  r^'s  can 
successfully  obtained,  then  we  can  use  (2.2.11)  to  obtain 
t a ^ : 

ai+l  1 ai+2  . 

a . = = — * — • - - 

1 ri  ri  ri+l 
- % / • 

For  the  extra  degree  of  freedom,  we  arbitrarily  let 


» - 1. 
n 


Tnus,  we  have 
(2 , 2, lU) 


n-1 

oi  ■ — 1 / n r,  , 
1 k=i  * 


which  is  exactly  what  was  stated  in  the  theorem. 


To  solve  for  { r^  : l_<i_<n-l},  we  first  multiply 


(2. 2. 12. n-1)  by  -rn_^  to  obtain 


(2.2.13)  - VpfclVr.lr  1+  (PnE[Kn)  / E tw^I  Ir^2 
k=l  ]=k  J 


- 1 (W-PnE(WnJ)  /Etwn-i)  - (1-P„.2))rn-1 


Equation  (2.2.12.n)  when  written  out,  becomes 


n-1  n-1  _ 

(2 . 2 . 12  . n)  E P,(  n r ) = W / E [W  ] - (l-o^] 
k=l  j=k  J 


or  , 


n-2  n-2 

k 


Ei°k<nvr3lrn-1+  pn-lrn-l  = w/E(wn>  ‘ ‘ <’n-l> 

=1  j=k  J 
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Adding  the  above  equation  to  (2.2.13),  we  obtain 


(PnE[Wnl  /E|Wn_1])rn_12+(  (W-0„E[Wni)  /EIW^I  - 
+ (1  - on  1 - W / E [Wn ] ) = 0 . 

Tms  is  a quadratic  equation  of  the  variable  only. 

We  note  that  the  coefficients  of  this  quadratic  equation  are 
the  an,  bn  ana  cn  defined  in  (2.2.7)  through  (2.2.9) 
respectively.  Thus,  r^_ ^ is  a root  of 

(2 . 2 . 6 . n)  a^X2  + bnX  + cn  = 0. 

Since  r ■,  is  defined  to  be  the  ratio  of  a and 
n-i  n 

y , and,  since  all  the  q- ■ 1 s are  positive,  we  should  expect 
n- 1 i 

r^_^  to  be  positive.  Also,  in  order  that  tn_^  be  uniquely 
defined,  we  need  to  show  that  there  is  only  one  positive 
solution  of  (2.2.6.n). 

This  is  the  case  if  we  can  show  that  a c < 0.  This 

n n 

2 

is  because  if  x-^  and  x2  are  two  roots  of  anX  + t^X  + cn  = 
U,  then  = cn^an‘  Thus,  ancn  < 0 if  and  only  if  cn/an 

< U,  which  means  that  x^  and  x2  are  opposite  in  sign.  This 
guarantees  that  either  x ^ or  x2  is  positive. 

Now,  from  (2.2.7),  it  is  easily  seen  that  an  > 0. 

Therefore,  we  need  to  show  that  cn  < 0.  From  (2.2.1)  with  I 
= (I,  2,  ...,  n-1},  we  have 
n-1 

k^pkEtV  > (l-o)Wan_!/  d-an.i)  • 
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d. 


— — — ■ 


Taking  the  difference  of  (1.3.2)  with  the  above  inequality, 
we  obtain 


Thus , 

or  , 

so 


PnElWn]  < pW  - (i-p 

= ( PW-CTn_1DW-CTn_1W+  crn_1PW)  / ( 1— «Jn_1) 

= (pW-CT^jW)  / ( l-crn_  x ) 

= PnW/  d-C^i)  • 


E[Wn]  < W/(l-  an_j) 
i-an_i<  W/E[Wn] , 
cn  = (l-an_i)  - W/E(Wn]  < 0, 


We  will  now  use  the  principle  of  induction  to  show  the 
following : 


(a)  Once  we  found  r.  , i<k<n-l,  then  r.  . must  be  a 

k l-l 

2 

root  of  the  quadratic  equation  a^X  +b^X+c^=0  where  a^,  b^ 
and  c are  defined  by  (2.2.3)  through  (2.2.5). 


(b)  There  is  only  one  positive  root  to  the  above 

2 

quadratic  equation  a^X  +b^X+c^=0. 

First  we  show  (a):  From  (2.2.12),  we  get 


i-2  i-2  n k 

(2.2.12. i-1)  £ p ( nr.)-  Z ( p^E  [W^.]  /E  [W±_1]  ) ( n r.) 

k=l  j=k  J k=l  j = i-l  J 

_ n 

= -{  (W-  £.PkE[Wk]  )/E[Wi_1]-(l-ci_2)Jri-1  . 
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Multiply  it  by  -r  , we  get 

i- 1 

i-2  i-1  i 


- E DV(  n r )+  £ (p  E[W  ]/E[W.  .])(  D r ) 

k=l  k j=k  3 k=l  k k 11  j=i~l  3 

" -f  (W-  j PkE[W  1 litj.j  . 

k=l 

Now,  aad  the  above  equation  to  (2.2.12),  we  have 

+ j. ‘»kE1\]/Elwi-i] > 1 ! rj>ri-i 

K—  1 ^ ^ 


2 (P.E[W.  ]/E[W.  ,]  ) ( n r ) 

k=i+l  * K 1 j=i  3 


= — ( (W  — £ PkE[Wk]  )/E[Wi_1l-(l-ai_2)ri_1 

k=i 

n 

+ (W-  £ PkE[Wk]  )/E[Wi)  - (l-ai_1)  . 

k=  i+ 1 


After  shifting  and  combining  terms,  we  obtain 


II  r*.—  x 2 

£ (pkE[Wk]/E[Wi_1]  ) ( n r.)rw 
p=i  j=i  J 


+ { (W  — £ PkElwk]  ) /E  [Wi_i]  - ( l-CTi_L)  Jri_1 
k=i 

n k- 1 

- £ (p,  E[W.  ]/E[W  ] ) ( nr.) 

k=i+ 1 k K 1 j=i  3 

n 

- (W-  £ p.  E[W.  ] )/E[W.  ]+(l-a  .)  = 0 

k=i+l  K 1-1 


This  is  an  equation  of  r,  ^ only,  since  all  the  rk ' s 
for  k i are  known.  Let  the  coefficients  of  the  above 
quaaratic  equation  be  a . ' , b , ' and  c . ' respectively.  We 
then  need  to  show  that  a.'  = a^  bi'  = bj_  and  ci'  = ci  as 
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defined  in  (2.2.3)  through  (2.2.5). 


Comparing  b^'  with  as  defined  by  (2.2.4),  we  can  see 
that  b^'  is  exactly  b^.  So  we  will  show  (i)  a^'  = a^  and 
(ii)  c ■ = c .. 


n 


(i)  a i' 


£ ( p-j^e (wki /e [w i—1l ) ( n rj) 
lc — i 3 i 


= p.E[W  ]/E[W.  J + 2 (p  E(W,]/E[W.  ,])(  nr.) 

1 1 k=i+l  K * j=i  3 


(E[W.]/E[W  ,]){p.+  Z (p.  E(W.  ]/E[W.]  ) ( n r • ) r . 

1 1-1  1 k=i+l  K K 1 j = i+l  3 1 


= (E[Wi]/E[Wi_1]  ) (Pi+  a^+1ri)  . 


But,  initially  we  know  that  an'  = an.  By  the  principle 
of  induction,  we  have  a^'  = a 

n k-1 

(ii)  From  (2.2.3),  a±1=  Z ( p.E  [W .]  /e  [W±]  ) ( n r.). 

k=i+l  j=i+l  J 

n 

Thus,  ci  = - ai+iri  - { (W-  Z OkE[Wk]/E[Wi]-(l-ai_1)J 

k=i+l 


- - ( (w'  k_^1pkEtV/Etwi,-'1-°l)  J 


= p.-a.  . r . -b . . 
a.  i+ 1 i i 


We  have  thus  completely  proved  (a).  We  will  next  show 
(b)  . That  is,  we  will  show  that  there  is  only  one  positive 

p 

root  of  a.  X '+b.  X+c  . =0  . 

i ii 
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f 


As  we  stated  above,  this  is  true  if  we  can  show  that 


ai  Ci  < We  have  already  seen  that  an  > U and  cn  <0,  so 

we  will  apply  the  principle  of  induction  to  assume  that  we 
have  found  a^  (i  + l<^k<_n)  to  be  positive  and  c^  (i+l<k<n)  to 
be  negative; therefore  r^  (i£k<n-i)  is  positive. 

From  (2.2.3)  it  is  easily  seen  that  a^  is  positive. 
Thus  we  only  need  to  show  that  c^  < 0. 

Instead  of  directly  proving  that  c^  < 0,  we  will  show 
the  following: 

(2.2.14)  For  i£k<_n-l,  c^  < 0 if  and  only  if 

E[wn-llanc„  * <bk+l-'k>  ' 

We  will  prove  this  by  induction. 

First,  from  (2.2.5),  we  have 


= p .-b  - Hr  + -x  /b2  - 4a  c ) 

n-1  n 2 2 J n n n 


b / b 2 

= , --#+  / Hr>  - . 


'n-1  2 V ' 2 

Therefore,  cn_1  < 0 if  and  only  if 


n n 


b / b 2 

VrfV1!’  "*  anCn 


which  is  equivalent  to 


b^  b^ 


or, 

a c < ■,  (b  -p  i ) • 

n n n-1  n n-1 

This  proves  (2.2.14)  for  the  case  k = n-l. 

Now,  we  assume  that  we  have  proved  (2.2.14)  for 

k=i+l,  i+2 , n-1.  We  will  prove  (2.2.14)  for  k=i 

Let  us  consider  the  quantity 


EIWi-i)aici  • 


From  (2.2.3)  and  (2.2.5),  we  have 


E[Wi_1Jaici  = E[Wi_1]  { (riai+1+pi)E[Wi]/E[Wi_1]  ) 


* (pi_bi+rriai+l1 


= E[Wi](-(ai+1r.)2-bi+1ai+1ri+pi(pi-b.  + 1)J 
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= E[W.  ] {-  [a  (^--(-bi+1  + /b2i+1-4ai+1c.  + 1)  ) 

l+l  v 

+ “T^1 2 + >i'b»r'i>i 


- E<wiItai+ici+rpi(bi+rI,i)i 

- Eiwiiai+ici+i-»iE[wi1  <bi+i-Bi»  • 

This  is  a recursive  relation  of  EtW)Jakck  * Therefore, 
we  can  recursively  substitute  the  quantity  in  the  right  hand 
side  of  the  above  equation  to  obtain 

n-1 

E(wi-lJaici  - E[Wn-l|ancn-  'Vr’k1  ' 

k = 1 

For  the  quantity  , we  have  already  seen  that 

a^>0.  Thus,  this  quantity  is  negative  if  and  only  if  c^<0  . 
The  equation  above  therefore  shows  that  c..  < 0 if  and  only  if 

n-1 

(2.2.15)  *[»„.,]  anon-  Z .»kEIWkJ(bk+1-Pk)  <0. 

K — 1 

This  proves  that  (2.2.14)  holds. 

We  will  now  show  that  (2.2.15)  indeed  holds.  This  will 
then  prove  that  c^  < 0 , which  will  complete  our  proof  of  (b). 
The  first  term  on  the  left  hand  side  of  (2.2.15)  is 

(2.2.16)  ElW^la^  = E(Wn.1)(pnE(Wnl/E[Wn.1l) 

X ( (l-an_L)  - w/E[Wnl  J 


= D E [W  ] ( 1-0  , )-0  W . 

n n ' n-1  n 
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The  second  term  of  (2.2.15)  is 


(2.2.17)  f . pkE[wk) (bk+l-pk) 

KL—  X 

n-l  _ n 

= E p E[W , ] { W - E p E[W.]  )/E[W.  ]-(l-a,  )-D,J 
k=i  K K j=k+l  D 3 K K * 

n-l  _ n 

= E (pk(W-  E p ,E[W  ] )-pkE[W  ) (1-a  ) J 

k=i  j=k+l  J J 

n-l  _ n-l  n n-l 

= ( E Pk)W-  E E pkp  E[W  ] - E p E[W  ] 

k=i  k=i  j=k+l  J J k=i 

n-l 

+ E °kak-lE[Wk] 
k=i 

- 'vrvi1"-”1  AE'MkJ  - Vp  PjE[Wj] 

k=i  }=i  k=x  J J 

n-l 

+ E PkCTk-lE^Wk^ 
k—  i 


= (an-l“ai-l)W'  E.pkE[Wk]  -VE.  ,E.pjpkE[Wk] 
k=i  k=i  ]=i  J 

n-l 

+ E pkak_^E[WkJ 
k — i 


n-l 


n 


- <°n-rt’i-i,w-^iV3iwk]  •k^i<0k-r0i-i)pkE[wk 

n-l 

+ kiiPkaic-lE[Wk] 


n-l 

= (»n-r0i-l»  - E»kEIWk'-(sn-r“i-l,»nE[Wn1 

K — X 


- ^(vr<’i-i»'kEt"k|t”^kViE|>lk1 
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_ n-1  n-1 

* <°n-r‘>i-l,<W-»nE‘Wn|)  ' 1 E.°kE<wk>  ' ^ . «k-ll,kE'wki 

k=i  k= 1 


+ °i-iE  »kE(V  + E »k<’k-iE(V 

K. — 1 K — 1 


- (1-a^iM  E PkE[w  ]). 

k=i 


Therefore,  (2.2.15)  becomes 


»nE<Wn'  <1-‘W-PnW-<=n-r'’i-1H"-°nE[Wn]) 


+ I1-®!.!*  ( £.PkElwkl  > 

lc=  i 


= pnE[Wn]  [ (1_CTn-l)+  ( an- } j ~W(  pn+  {an-rffi-l)  j 


+ (l-o i-i) ( £ pkE[wk]) 
k=i 


= (1-Oi_1)o  E[W  ]-W(p-CTi_1)+(l-CTi_1)  E PkE[Wk) 

k=i 


{1-ai-l)k^iDkE[Wk] ~ (1“ai-l)kE1PkE[Wk^  * 

Now,  from  (2.1.4),  when  I = [ 1 , 2 , . . . , i- 1 J , we  have 


E okE[Wk]  > (l-o)ci_1w/(l-CTi_1) 
k=  1 


Therefore 


(l-p)ai_1W  < (l-ci_1)  E pkE[Wk)  , 

1 


(l-p)ai_1W-  (1-Ci_1)  E pkE[Wk]  < 0. 

k — 1 

This  shows  that  (2.2.15)  holds,  and  thus  completes  the 


proof  of  (b) . 
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To  determine 


From  Theorem  2.3,  we  have  the  following: 
the  values  of  control  parameters  { } for  a given  set  of 

(E  IW , Elw2l,  E L Wn  J ) satisfying  (1.3.2)  and  (2.2.1), 

we  first  rename  the  class  indices  so  that  (2.2.2)  is 
satisfied,  and  then  use  the  algorithm  to  obtain  the  values 
of  the  control  parameters  { } . Thus,  any  point 

W = (E [W^] , E lW2 ],..., E [Wn ] ) lying  in  the  space  defined  by 
(1.3.2)  and  (2.2.1)  is  achievable  by  escalating  priority 
scheduling.  This  shows  that  the  feasible  performance  space 
of  mean  waiting  times  of  escalating  priority  scheduling  is 
the  space  defined  by  (1.3.2)  and  (2.2.1). 

The  following  theorem  shows  that  escalating  priority 
scheduling  covers  the  spectrum  from  FCFS  scheduling  to  fixed 
priority  scheduling. 

THEOREM  2.5.  (a)  FCFS  scheduling  can  be  achieved  from 

escalating  priority  scheduling  by  setting  the 
parameters  { } all  equal. 

(b)  Fixed  priority  scheduling  is  the  limiting 
rule  of  escalating  priority  scheduling  when  the 
ratios  between  successive  parameters  all 

approach  infinity. 


Proof:  (a)  When  we  set  all  the  parameters  { } equal, 

are  in  fact  ignoring  the  differences  among  priority  classe 
because  all  jobs  entering  the  queue  have  the  same  priorit 
increasing  rate  and  therefore  the  next  job  selected  fc 
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service  is  always  the  one  that  has  been  waiting  the  longest. 
Thus,  selecting  the  job  with  the  highest  priority  index  is 
equivalent  to  selecting  the  job  that  entered  the  queue  the 
earliest.  This  is  precisely  f irst-come-f irst-serve 

scnedul ing . 

(d)  Given  that  for  all  i,  l£i<n,  then  it  is 

impossible  to  have  any  job  from  class  i+1  with  priority 
index  larger  than  any  job  in  the  queue  from  class  i.  This 
is  because  as  soon  as  a job  from  class  i comes  into  the 
system,  its  priority  index  will  increase  at  such  a fast  rate 
that  it  surpasses  all  the  priority  indices  of  jobs  from 
class  i+1,  no  matter  how  long  they  have  waited  in  the  queue. 
Within  the  same  priority  class,  the  job  which  entered  the 
earliest  will  have  the  highest  priority  index.  Thus, 
f irst-come-f irst-serve  prevails.  Therefore,  in  this 
limiting  case,  we  have  fixed  priority  scheduling. 

To  check  this,  we  will  show  that 

(2.1.5)  lim  E[Wi]  = (1-P)  W / (1-c.  ,)  (l-o.), 

c 1—1  i 

where  c is  the  condition  that  dj/a i+ 00  > l_<i<n.  We  will 

prove  this  by  induction  on  k (from  n backwards  to  1) . 

First,  from  (1.3.10), 

_ n-1  _ 

lim  E[Wn]  = W / [1-  £ (1-0 ) Pd  = W / (1-p+p  ) = E[Wn]  . 
c k=l  n 
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p 


Next,  we  assume  that  (2.1.1)  is  true  for  all  k 
i+2,  ...,  n.  We  show  that  it  is  also  true  for  k=i. 

Now,  for  each  k>i,  it  can  be  seen  that 

i = I ) 

(l-ak_i> (l-ok)  Pk  X-°k  1-ak-l 
Thus,  for  k < i , 


lira  E[Wkl  = (l-p)w/(l-ak_1) (l~ak) 
c 


(1-P)W 

pk 


1-CT 


k-1 


So,  we  have 


lim  E[Wi) 
c 


W - T.  pk  • lim  E [Wkl 
k-i-f-1  c 

m 

1-  £ Pk 

k=l 


1_ai-l 


n _ 

{W-  2 (1-p  )W 

k=  i+ 1 


' l-o, 


1-CT 


k- 


1_ai-l 


{w  - w + 

1-a 

n 


(l-p)W, 
1-a  • 1 


(1-P)W 

- (l-CTi_1) (1-CT.)  • 
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The  above  theorem  states  that  the  class  of  escalating 
priority  scheduling  covers  the  spectrum  from  first-come- 
first-serve  scheduling  with  no  discr iminatin  to  fixed 
priority  scheduling  with  FCFS  rule  to  determine  the  order  of 
service  for  jobs  within  the  same  class. 

2 . 3 Interpretations  of  Boundary  Points 

In  Section  2.2,  we  showed  that  the  feasible  performance 
space  of  mean  waiting  times  for  escalating  priority 
scheduling  is  characterized  by  (1.3.2)  and  (2.2.1).  This  is 
an  open  set  consisting  of  the  interior  points  of  the  space 
defined  by  (1.3.2)  and  (2.1.4).  in  this  section,  we  study 
the  interpretation  of  the  boundary  points  of  the  space 
defined  by  (1.3.2)  and  (2.2.1). 

In  section  2.1,  it  was  shown  that  the  extreme  points  on 
the  boundary  represent  mean  waiting  times  behaviors  of 
different  orderings  of  priority  classes  under  fixed  priority 
scheduling.  By  Theorem  2.5  (b) , these  points  correspond  to 
limiting  rules  of  escalating  priority  scheduling  when 

“i/ai+1',“  for  a11 

We  will  now  investigate  the  meanings  of  non-extreme 
boundary  points  for  a 3-class  queueing  model  using  a three 
dimensional  geometrical  representation. 
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In  figure  2.3,  the  points  Q.  's  are  the  points 

13k 

corresponding  to  mean  waiting  times  performance  of  fixed 
priority  scheduling  with  highest  priority  given  to  class  i, 
the  next  highest  to  class  j,  and  lowest  to  class  k.  The 
point  P is  (W,  W,  W)  ana  corresponds  to  FCFS  scheduling. 

The  interior  points  of  the  hexagon  Q 12;^  2 1 3^  2 3 1^  32 11^  3 12^  1 32 
are  the  open  space  defined  by  (1.3.2)  and  (2.2.1).  They  are 
the  feasible  performance  space  of  mean  waiting  times  under 
escalating  priority  scheduling. 


Figure  2.3  Boundary  Points  of  Performance  Space  in 
3-Class  Escalating  Priority  Scheduling 

Without  loss  of  generality,  we  consider  the  performance 
space  of  mean  waiting  times  for  the  priority  ordering  that 
gives  class  1 the  highest  priority,  class  2 next,  and  class 
3 the  lowest.  From  Theorem  2.3,  this  space  is  the  area  in 
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Thus, 


it  is 


the  hexagon  such  that  ElW^]  < E L W 2 ] < EtW^]. 
the  area  bounded  by  P,R,Q123,S  in  Figure  2.5, 
PR  is  defined  as  E[W^]=ElW2]  and  PS  is 
E [W2  J =E  [W3J  . 


where  the 
defined 


line 

as 


Let  us  first  consider  the  parametric  change  of 
parameters  in  escalating  priority  scheduling  with 
= (a,a,ak)  for  k:l-»0  (a  is  any  positive  constant).  Since 
0^=02'  we  must  have  E [W-jJ  =E  [W21  • We  note  that  initially, 
when  k=l,  we  are  at  the  point  P=(W,  W,  W) . Now,  when  we 

decrease  the  value  k,  starting  from  1,  the  point 
(E [ W^ ] , E [W2 ] , E [W3 ] ) moves  from  P along  the  line  PR  towards 


R. 

This  means  chat 

E [ W 3)  is  increasing  because 

the  value 

of 

Q 3 

is  decreasing. 

Hence,  when  k=0 , we  must  be 

at  point 

R, 

and 

this  point 

represents  the  limiting 

behavior 

of 

escalating  priority  scheduling  with  ®1=0'2  and  a2^a3  _>c0  • 


This  limiting  situation  can  be  thought  of  as  fixed 
priority  scheduling  applied  to  two  priority  groups:  The 
high  priority  group  consisting  of  classes  1 and  2;  the  low 
priority  class  group  consisting  of  class  3.  Within  the  high 
priority  group,  no  discrimination  is  applied  between  class  1 
and  class  2 jobs. 


Next,  consider  the 
(a(k+b(l-k) ) ,a,ak) , where 


parametric  setting  (<^,<*2*03)  = 

a>0  and  b^l . Again,  when  we  start 


with  k=l,  we  are  at  point 
relative  discriminations 
between  class  2 and  class 


P.  Now,  as  k decreases,  both  the 
between  class  1 and  class  2,  and 
3 increase.  This  is  because  a}/a2 
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= k + b ( 1 — k ) _>  1 and  a^/ot^  = 1/k  > 1.  This  means  that  the 
point  is  moving  from  P away  from  PR  and  toward  the  boundary 

RQ  because  E[W9]  and  E[W9]  are  both  increasing. 

12  3 c J 

Therefore,  when  k approaches  U,  the  point  must  lie  on  the 
boundary  R<2^23  ana  represents  the  limiting  setting  of 
Ol/a2=b  and  a,2//a3~,0°  * 

This  limiting  situation  can  again  be  thought  of  as 
fixed  priority  scheduling  applied  to  two  groups:  the  high 

priority  group  consisting  of  classes  1 and  2 and  the  low 

priority  group  consisting  of  class  3 only.  Now  however, 

within  the  high  priority  group,  class  1 and  class  2 jobs 
compete  for  service  using  escalating  priority  scheduling. 

We  will  now  investigate  changing  the  value  of  b.  When 
b=l  and  k -*  0 , a^/a2  = i-  and  a2/'a3~,a>  ’ ^now  from  our  first 

example  that  this  parameter  setting  corresponds  to  the 
boundary  point  R.  When  b and  k -»  0 , /a2  ~*a>  and 

q'2/q'3'+c°  • From  Theorem  2.5  (b)  we  know  that  this 
corresponds  to  the  extreme  point  Qi23*  Therefore,  when  b 
varies  between  1 and  < 00  , l<Lot^/a2<co  and  °2/a3 ->co  and  thus 
the  corresponding  limiting  point  lies  on  the  line  segment 
RQl23. 

Similarly,  the  line  segment  SQ^23  represents  the 
limiting  points  when  o^/ 0'2~,co  and  a2/a3=b  for  l_<b<°°  . This 
can  be  seen  using  the  parametric  setting 
( 0'^o'2,a^  - (a/k  , a , a ( k+ (1-k ) /b)  and  using  the  arguments 
presented  above  for  line  segment  RQ^23. 


61 


Therefore,  non-extreme  points  on  the  boundary  can  be 
thought  of  as  representing  the  mean  waiting  times 
performance  of  priority  scheduling  rules  with  a "mixture"  of 
fixed  priority  scheduling  and  escalating  priority 
scheduling.  We  now  formally  define  this  new  rule  of 
scheduling : 

DEFINITION.  In  multiclass  queueing  systems,  a scheduling 
rule  is  called  mixed  priority  scheduling  of 
escalating  priority  and  fixed  priority,  or,  MEFP 
scheduling  in  short,  if  it  separates  the  arrival 
classes  into  several  priority  levels  such  that 
fixed  priority  scheduling  is  applied  between 
different  levels,  and  escalating  priority 
scheduling  is  applied  within  levels. 

This  scheduling  mechanism  is  shown  in  Figure  2.4. 
Operationally,  let  the  n arriving  classes  be  grouped  into  m 
levels,  eacn  level  i consisting  of  n^  priority  classes.  For 
each  priority  class  (i,j),  there  is  a control  parameter 
associated  with  it.  Jobs  within  the  level  i compete  for 
service  with  priorities  increasing  witn  rate  a — . When  the 
server  becomes  free,  it  searches  from  the  top  level 
downwards  to  find  the  first  level  with  at  least  one  nonempty 
queue.  It  then  select  the  job  within  this  level  with  the 
highest  priority  index  for  service. 
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Figure  2.4  MEFP  Scheduling  Mechanism 

Notice  that  in  this  scheduling  rule,  within  each  level 
i,  we  have  n^-1  degrees  of  freedom  for  controlling  the 
parameters  { }.  Thus,  the  total  degrees  of  freedom  for 
controlling  this  scheduling  is  n-m. 

Kleinrock  first  studied  this  limiting  situation  of 
escalating  priority  scheduling  in  [KLEI66]  and  called  it 
Strict  and  Lag  Priority  Mixture  (SLPM) . He  derived  mean 
waiting  times  behavior  for  M/M/1  queueing  systems  under  SLPM 
scheduling . We  note  that  his  results  for  MEFP  (SLPM) 
scheduling  are  also  true  for  M/G/l  queueing  systems  for  the 
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reasons  given  in  Section  1.3  that  his  results  of  escalating 
priority  scheduling  can  be  extended  from  M/M/1  queueing 
systems  to  M/G/l  systems. 


vie  now  summarize  the  meanings  of  the  boundary  points  of 
the  feasible  performance  space  of  mean  waiting  times  under 
escalating  priority  scheduling: 

(a)  Extreme  points  on  the  boundary  represent  mean 
waiting  times  of  priority  classes  under  fixed  priority 
scneduling . 

(b)  Non-extreme  points  on  the  boundary  represent  mean 
waiting  times  of  priority  classes  under  MEFP  scheduling. 

Before  we  conclude  this  chapter,  we  remark  that  since 
boundary  points  are  limiting  behaviors  of  escalating 
priority  scheduling,  we  can  modify  Theorem  2.4  to  obtain  a 
set  of  values  for  escalating  priority  scheduling  that  can 
approximate  the  performance  of  fixed  priority  and  MEFP 
scheduling.  This  is  as  follows: 

Given  a set  of  mean  waiting  times  performance  of  either 
fixed  priority  or  MEFP  scheduling,  they  correspond  to  a 

boundary  point  of  the  space  defined  by  (1.3.2)  and  (2.1.4). 

, 2 
wnen  we  solve  the  quadratic  equation  aGt  + b^X  + = 0 

following  Theorem  2.4,  instead  of  having  a unique  positive 

root,  we  will  have  a unique  non-negative  root.  In  this 

case,  (2.2.10)  can  not  be  directly  applied  to  determine  the 

values  of  { } because  the  denominator  of  the  expression  in 
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(2.2.10)  is  zero.  However,  if  we  recall  from  (2.2.11)  that 
r^  is  defined  as  the  ratio  then  r^=0  means  that  the 
value  or^/a^  ^ must  approach  infinity.  Thus,  in  order  to 
obtain  a set  of  parameters  such  that  °y/Qi+ 1"*00/  we  can 
replace  r^=0  by  r ^=e  where  e is  a very  small  positive 
quantity  so  that  /ai+i  =l/e  will  be  large.  Using  this 


substitution,  we  can 

then  obtain 

a set 

of  positive  r 

^ 1 s and 

therefore 

(2.2.10) 

of  Theorem 

2.4  can  be  used  to 

obtain  a 

set  of  values  { } . 

Therefore , 

using 

this  set  of 

values 

for  the 

control 

parameters 

in 

escalating 

pr ior ity 

scheduling , 

fixed  priority  and 

MEFP 

schedul ings 

can  be 

approximated . 
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CHAPTER  3 


OPTIMIZATION  OF  ESCALATING  PRIORITY  SCHEDULING 

3 . 0 Introduction 

In  Chapter  2,  we  showed  that  escalating  priority 
scheduling  can  cover  the  spectrum  of  scheduling  from  FCFS 
scheduling  to  fixed  priority  scheduling  by  adjusting  the 
control  parameters  { a ^ } and  that  the  feasible  performance 
space  of  the  EtW^J's  is  defined  by  (1.3.2)  and  (2.2.1).  In 
this  chapter,  we  present  algorithms  for  determining  the 
values  of  { E [ ] } to  optimize  various  cost  functions.  We 
then  can  use  the  algorithm  presented  in  Theorem  2.4  to 
obtain  the  values  of  the  control  parameters  { } in 

escalating  priority  scheduling  to  give  the  values  of  mean 
waiting  times  determined  from  these  optimizing  algorithms. 

We  will  develop  optimization  algorithms  for  two  general 
cost  objective  functions.  Each  of  these  general  functions 
has  several  special  cases  of  interest.  Two  new  variables, 
s,  and  c.,  are  used  in  these  general  functions.  s . is  a 

IX  1 

constant  for  class  i and  its  meaning  differs  for  each  of  the 
various  cases.  Cj_  represents  the  appropriate  cost  of  each 
job  in  class  i for  the  objective  function  being  used.  The 
two  general  objective  functions  are  the  following: 

n 

(a)  Minimize  £ \ck(E  [Wk] -sk)  , (m>l) 

k — 1 
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and 


(b)  Minimize  max  { x,  c,  (E  [ W 1 -s,  ) ; l<k<n). 

k k k k — — 


We  will  now  present  some  of  the  interesting  special 


cases  of  the  general  objective  functions. 


Case  .1.  For  objective  function  (a)  with  m=l  and  s^=-E[S^], 
we  have: 


Minimize  £ X^c^ (E [W^j +E [S^] ) , 
k — 1 

where  c^  is  the  cost  of  the  time  spent  in  the  system  per 
unit  time  for  each  job  from  class  i.  This  objective 
function  minimizes  the  expected  total  cost  of  the  average 
time  jobs  spend  in  the  system  (waiting  plus  service  time) . 

Case  2.  For  the  objective  function  (a)  with  m=l  and  s^O 
for  all  i,  we  have: 


Minimize  E X,  c,  E [W  ] , 
k=l  K K K 

where  c^  is  the  cost  of  waiting  per  unit  time  for  each  job 
from  class  i.  This  objective  function  minimizes  the 
expected  total  cost  of  the  average  time  jobs  spend  waiting 
for  service. 


Case  3>*  For  the  objective  function  (a)  with  m=2  and 
s^=-E (S • ] , we  have: 


Minimize  E Xkck (E [ Wk] +E [Sk] ) , 

k — 1 


where  c ^ is  the  cost  of  the  time  spend  in  the  system  per 
unit  time  squared  for  each  job  from  class  i.  This  objective 
function  minimizes  the  expected  total  cost  of  the  square  of 
the  average  time  jobs  spend  in  the  system. 

Case  4.  For  the  objective  function  (a)  with  m=2  and  s^O  for 
all  i,  we  have: 

n 2 

Minimize  £ \ c E [W  ] , 

k=l  * K K 

where  c,  is  the  cost  of  waiting  per  unit  time  squared  for 
each  job  from  class  i.  This  objective  function  minimizes 
the  expected  total  cost  of  the  square  of  the  average  time 
jobs  spend  waiting. 

Case  For  the  objective  function  (b)  , if  we  interpret  s^ 

as  the  slack  time  allowed  for  jobs  from  class  i,  then 
E[W^l~s^  is  the  expected  lateness  of  a job,  and  c^  is  the 
cost  per  unit  time  for  each  job  late  in  class  i.  Thus  this 
objective  function  minimizes  the  expected  lateness  cost  of 
the  average  lateness  of  all  jobs  in  the  class  having  the 
maximum  expected  lateness  cost. 

From  optimization  theory,  optimal  solution(s)  exist  for 
any  closed  and  bounded  feasible  solution  domain.  Since  the 
feasible  performance  space  of  mean  waiting  times  of 
escalating  priority  scheduling  is  an  open  space  defined  by 
(1.3.2)  and  (2.2.1),  we  will  add  the  boundary  points  of  this 
space  to  the  solution  domain  to  guarantee  that  optimal 
solution(s)  exist.  Thus,  for  our  optimization  problems,  the 
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constraints  are  (1.3.2)  and  (2.1.4).  For  these  problems,  if 


the  optimal  solution(s)  obtained  falls  on  a boundary,  we  can 
use  the  modification  of  Theorem  2.4  described  in  Section  2.3 
to  obtain  a set  of  { J that  will  approximate  the  optimal 
mean  waiting  times. 


In  the  following  sections.  Section  3.1  discusses 
optimization  of  (a)  with  m=l  and  (b)  and  Section  3.2 
discusses  the  optimization  of  (a)  with  m>l.  In  Section  3.3, 
we  present  a series  of  examples  to  illustrate  the 
optimization  techniques  described  in  Sections  3.1  and  3.2. 

3 . 1 Optimization  with  Linear  Cost  Functions 

In  this  section,  we  discuss  optimizing  the  following 
objective  functions: 

n 

(3.1.1)  Minimize  E \ c (E[W  ]-s.) 

K K iC  K. 

and 


(3.1.2)  Minimize  max  { \Cy.(E  [W^ -sk)  ; l<k<n} 
subject  to 

(1.3.2)  0 jE  [W  x]  + o2E[W2]  + ...  + PnElV^]  = pW, 
and 

(2.1.4)  VlcN  E pvElwvl  > ( 1 - P)  ( E PV)W/(1-  E p J . 

k6I  K - k6l  k k€l  k 

We  first  aiscuss  (3.1.1).  For  this  objective  function, 
we  have  a bona  fide  linear  programming  problem.  Thus,  an 
optimal  solution  can  be  obtained  by  the  simplex  method. 
From  the  theory  of  linear  programming,  if  a single  optimal 
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solution  exists,  it  must  occur  at  an  extreme  point.  When 
multiple  optimal  solutions  exist,  they  must  be  convex 
combinations  of  optimal  extreme  points. 

A solution  to  this  optimization  problem  can  be  obtained 
directly  without  using  the  simplex  method  because  we  know 
that  extreme  points  represent  fixed  priority  scheduling. 
We  recall  from  Theorem  1.4  that  for  fixed  priority 
scheduling,  to  minimize 
n 

(1.3.7)  £ >-kckE  [Wk]  , 

k — 1 

the  optimal  ordering  of  priority  classes  follows  the 
descending  order  of  If  for  some  i and  j,  c ^ = p,  c , 

then  multiple  optimal  solutions  exist  and  the  ordering  of 
these  two  classes  is  unimportant. 


Now,  since  the  objective  function  (3.1.1)  can  be 
rewritten  as 


n 


n 


£ \Rck(E  [WJ -sv)  = £ \vcvE(Wv]  - £ \vcvsv, 


k k 


k k k' 


k=i  ~ 'v  ~ k=l  J'  ~ k=l 

optimizing  (3.1.1)  is  equivalent  to  optimizing  (1.3.7)  when 
the  two  problems  have  the  same  solution  domain.  This  is 
because  the  last  summation  term  is  a constant  independent  of 
the  E [Wi ) 1 s . 


Even  though  the  solution  space  for  optimizing  (3.1.1) 
contains  points  other  than  the  extreme  points  that 
correspond  to  fixed  priority  scheduling,  we  can  temporarily 
restrict  ourselve  to  this  set  of  extreme  points  to  obtain 
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initial  optimal  solution(s).  Under  this  condition,  we  can 
use  Theorem  1.4  to  obtain  initial  optimal  solution(s)  and 
then  take  convex  combinations  of  all  the  initial  optimal 
solution(s)  to  obtain  multiple  optimal  solutions.  A summary 
of  the  above  discussion  is  the  following: 

To  optimize  (3.1.1),  first  order  the  p^cVs  in 
descending  sequence. 

Case  (a):  If  no  two  M c.'s  are  equal,  then  the  optimal 
solution  to  (3.1.1)  is  unique.  Denoting  the  class  indices 
for  the  descending  sequence  of  p^c^  as  1',  2',  ...»  n',  then 
from  Theorem  1.3,  we  have 

(3.1.3)  E (Wi . ) * = (1-p)  W / (l-CTi_1 . ) (l-ai,). 

Case  (b) : If  some  of  the  p^c^'s  are  equal,  then  for  all 
non-ascending  orderings  of  P^c^,  we  first  use  (3.1.3)  to 
obtain  all  the  optimal  solutions  within  the  domain  of  fixed 
priority  scheduling.  Then,  the  solutions  to  (3.1.1)  are  the 
set  of  all  convex  combinations  of  the  initial  optimal 
solutions . 

Next,  we  discuss  optimization  of  (3.1.2)  subject  to 
(1.3.2)  and  (2.1.4). 

To  solve  this,  we  employ  the  following  argument:  Given 
an  objective  function  f(x)  with  solution  domain  X,  instead 
of  searching  directly  through  the  whole  domain  of  X for  the 
minimum,  we  decompose  X into  several  subregions  and  first 


determine  the  minimum  within  each  subregion.  Then,  the 
optimal  solution  to  the  original  problem  is  the  solution  of 
the  sub^pace  tnat  has  the  smallest  value  for  the  objective 
function  among  tne  minimum  values  found  for  all  the 
subspaces.  Mathematically,  we  have: 

(3.1.4)  min  f (x)  = min  min  f (x)  if  X = U X. . 

x£X  i£l  x€X . i€l  1 

i 

Now,  for  our  optimization  problem  (3.1.2),  we  decompose 
our  solution  space  R defined  by  (1.3.2)  and  (2.1.4)  into  n 
suoregions  R1f  R„,  ...,  R such  that  each  region  R.  is 
defined  by  (1.3.2),  (2.1.4)  and  the  following  set  of  n-1 

constraints : 

(3.1.5)  \ .c  . (E tW  . ] -s  . ) ^ \ c (E [W  ] -s, ) for  all  k^i. 

11  1 1 K K K K 


We  note  that  for  each  i,  (3.1.5)  is  a different  set  of 
linear  inequalities  and  it  means  that  for  each  point  W = 
(E  [W ,E  [W2J  , . • . , E [WnJ  ) in  R^,  if  we  consider  the  values 
*1C  l(E  t W i]  -s  , i-2  °2  ( E [ W 2 J -s  2^  ' • • • c^(E  [W^] -sn)  , then 

\ iC i (E [ W i] -s  ) is  the  largest  among  them.  That  is,  for  each 
W £R  i , 

max  { \kck(E[Wk]-sk)  ;l<k<n}  = \±c  i(E  [WJ -s  i)  . 


Now, 


since  for  each  point  W in  R,  if  we  consider  the  values 


\1ci(E  l W L] -s  1)  , *2C2(E  t w 2) -s  2)  , . 


\ c (E [W  ]-s  ),  then  one 
n n n n 


of  them  must  be  the  largest.  If,  say,  \ c (E [W  1 — s ) is  the 

3 3 3 3 

largest  among  the  above,  this  means  that  the  point  W must 
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1 ie  in  reg ion  R . . 

3 

the  regions  R^. 


Thus,  each  point  in  R must  lie  in  one 
This  proves  that 


of 


R-lUR2U  ...  uRn=R. 


We  now  define  f (W)  = max  {Xkck  (E  -sk)  ' 1lkln}  . 

Then  our  minimization  problem  (3.1.2)  becomes  min  f (W) . 

W£R 

From  (3.1.4),  this  problem  is  then  equivalent  to  min  min 

IS i<n  W£R . 

f (W) , which  is 

(3.1.6)  min  min  max  { X,  c,  (E  IW,  ] -s,  ) ; l_<k_<n}  . 

l<i<nW€Ri 

Since  for  each  point  W within  region  R^ , we  have 

max  { Xkck (E [Wk] -sk) ; l<k<n}  = X^c^ (E [ W^ ) -s^ ) , 

substituting  this  into  (3.1.6),  we  obtain 

min  min  \ . c . (E [W . ] -s . ) . 
l<iSn  R.  1 1 1 1 

l 

The  above  shows  that  we  can  solve  (3.1.2)  by  the 
following  two-step  procedure: 

Step  (1):  For  each  i,  l£i<n,  solve  the  following: 

(3.1.7)  Minimize  \ . c . (E [W . ] -s . ) 

ii  ii 

subject  to  (1.3.2),  (2.1.4)  and  (3.1.5). 
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Step  (2):  Select,  among  all  the  optimal  solutions  obtained 
from  step  (1),  the  one  with  the  minimum  objective  value. 
This  solution  is  then  the  optimal  solution  to  the  original 
problem  (3.1.2) . 


We  remark  that  in  the  step  (1)  above,  n linear 
programming  problems  need  to  be  solved,  each  one  using  a 
subregion  of  the  original  solution  space  defined  by  the 
constraint  sets  (1.3.2),  (2.1.4)  and  (3.1.5).  For  each 
different  i,  (3.1.5)  defines  a different  region  . It  is 
possible  that  under  the  partition  used,  some  subregions  may 
be  empty  so  that  feasible  solutions  for  these  problems  do 
not  exist.  However,  since  at  least  one  of  the  regions  R^  is 
always  nonempty,  a solution  to  our  original  problem  exist. 

Before  we  conclude  this  section,  we  point  out  the 
following  interesting  observation:  For  the  optimization  of 
(3.1.2),  if  the  optimal  solution  falls  in  the  interior  of 
the  solution  domain,  then  the  following  relation  must  hold: 

(3.1.8)  \ ^c  ^ (E  Sj_)  ^2^  ~s2^  = " * * = ^ncn^  ^ ~sn^  * 

This  is  because  if  W*  = (E  [W-J  *,E  [ W 2 ) *,  . . . ,E  [Wn]  *)  were 
an  optimal  solution  but  did  not  satisfy  (3.1.8),  then  if  we 
rename  the  class  indices  as  1 ' , 2',  ...,  n'  such  that 

(3*i-9)  ^l'C].,  (E(W^,]-s^,)  ^2  ^2 1 (E  [W2  1 1 "S2  i ) 

> . . . > \n.cn,  (E[Wn,]-sn.)  , 
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then  at 

least  one  of 

the 

above 

">"  must 

have 

strictly 

greater 

relation  ">". 

Let 

class  i1 

1 and  i+1 ' 

be  the 

smallest 

indices  such  that 

ii,ci,  (E  [Wi,  ]-si,  ) > *i+i«ci+1.  (E  [Wi+1,  ] ~si+1. ) . 
That  is , we  have 

\1,c1, (E [W  x , ] -s  L , ) = X2,c2, (E [W2 , ] -s2 . ) 

= ...  = \^,c • , (E [Wi , ] -s i , } . 

Then,  from  the  conservation  law  of  mean  waiting  times, 
it  is  possible  to  further  reduce  the  E[W^,]'s,  l<.k<Ci,  by 
increasing  E[W  ],  i+l<k<n  so  that  X ^ , c ^ , (E [W^ , ] -s  ^ , ) will 
be  decreased.  This  contradicts  our  assumption  that  W*  is 
optimal . 

We  note  that  the  above  argument  does  not  apply  to  the 
boundary  points  because  if  we  try  to  reduce  the  values  of 
some  of  the  E[w^j's,  then  they  may  be  forced  off  the 
boundary  and  become  infeasible. 

The  above  shows  that  if  we  temporarily  neglect  the 
boundary  constraints  (2.1.4),  then  (3.1.8)  can  be  used  to 
obtain  a solution  to  the  relaxed  problem  (3.1.2)  with  only 
the  constraint  (1.3.2).  Thus,  for  our  original  problem, 
instead  of  following  the  aforementioned  procedure  of  solving 
n linear  programming  problems,  we  can  first  solve  (3.1.8) 
and  (1.3.2)  to  obtain  a set  of  {E[W^]},  and  then  test  to  see 
if  the  constraint  set  (2.1.4)  is  satisfied.  If  the  solution 


l 
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does  satisfy  the  constraint  set  (2.1.4),  then  it  is  an 
optimal  solution  to  our  original  problem.  However,  if  some 
of  the  constraints  in  (2.1.4)  are  violated,  then  we  must  use 
the  original  procedure  to  obtain  a solution. 

We  now  return  to  the  problem  of  solving  (3.1.8)  and 
(1.3.2)  for  {E [W^] > . First,  let  the  common  value  of  (3.1.8) 
be  t,  so  that 


E [ W± ] = si  + t / Xici. 


Substitute  this  into  (1.3.2),  we  have 


n 


E pk(sk  + t/xvcv>  = PW» 


k=l 


k k' 


n 


or  , 

Thus,  we  have 
(3.1.10) 


t = (p  W - E p s ) / ( E 1/n.c.  ) 
k=l  k=l  * * 


n 


E[W.]  = s^^  + (pW-  E Pksk)/(^ici  S l/iikck) 

k — 1 k — 1 


This  is  then  the  solution  to  the  relaxed  problem 
(3.1.2)  with  constraint  (1.3.2)  only.  If  it  also  satisfies 
(2.1.4),  then  it  is  the  optimal  solution  to  (3.1.2). 
However,  if  it  fails  to  satisfy  all  the  inequalities  in 
(2.1.4),  then  we  must  use  our  original  solution  procedure. 


We  now  illustrate  this  with  a two-class  priority 
queueing  example. 


76 


Figure  3.1  Minimize  max  {x^c^  (E  1 -sk^  ' 


In  Figure  3.1,  our  solution  space  for  optimization  is 
the  line  segment  ^12^21'  Using  (3.1.10)  and  the  given 
positive  values  for  s^  and  s2  , we  have: 


Thus,  if  Vi  = (EtWjJ,  E [W2 1 ) lies  in  between  Q 12  and 
Q21,  then  it  is  an  optimal  solution.  However,  if  it  does 
not,  then  we  need  to  use  the  two-step  procedure  that  we 
described  earlier. 

In  this  example,  the  point  S is  defined  as  (s s ^ . 
It  is  interesting  to  note  the  following  behavior  of  the 
objective  function:  At  the  point  S,  E[W^]  = s^  and  E [ W2 ] = 
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s2 , so  that  the  objective  value  is  0.  At  point  W,  the 
objective  function  assumes  some  positive  value,  say  b,  so 
that 

max  tx^c.^  (E  [W^]-s^)  ; k = l,2}  = b. 

Thus,  the  contour  of  all  points  satisfying  the  above 
equation  is  the  two  "half  lines"  passing  through  the  point 
W that  are  parallel  to  the  two  axes. 


3 . 2 Optimization  with  Nonlinear  Cost  Functions 

In  this  section,  we  discuss  priority  scheduling 
subject  to  non-linear  cost  of  delay.  Our  problem  is  the 
following : 


(3.2.1) 
subject  to 


Minimize  E \kck  ( ElW^  -sk 


, m 


(m>l ) 


(1.3.2)  p^lWjl  + p2E  [W2  ] + ...  + PnE[Wn]  = pN, 


and 


(2.1.4)  VIcN  E okE[Wk]  > (1-p) ( S PV)W/(1-  £ Pt 


k6l 


k€l 


k€l 


First  we  discuss  the  case  when  m=2.  With  a quadratic 
objective  function  and  linear  constraints,  we  have  a 
quadratic  programming  problem. 
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This  quadratic  constrained  optimization  problem  can  be 
solved  based  on  a theorem  developed  by  Kuhn  and  Tucker 
[KUHN51],  which  we  will  now  state  in  the  following.  (The 
proof  of  this  theorem  can  be  found  in  most  nonlinear 
programming  books.) 


THEOREM  3.1.  Consider  the  problem 

maximize  Zo  = f (Z^,  Z 2»  . ..,  Z n) 
subject  to 

(1)  g ,^(  Z L 2»  • • • » Z ^)  — ^ i — 1»2,  ...  (^  ; 

(2)  n j (Z  ir  Z 2»  • • • » Z ^)  ^ for  j — 1 , 2 , . . . ,m, 

where  f,  g ^»  h ^ , are  all  continuously 

differentiable  functions.  Then, 

Z 

solution  to  the  non-linear  optimization  problem 
only  if  there  exists  £ + m multipliers 


Z*  (Z  v Z 2» 


. .,  u n,  can  be  an  optimal 


(3.2.2) 

x i»  x 2»  ...»  , and  y^  » y2  » • • • » 

that  the  following  conditions  are  all 

£ dg ■ m sh. 

SZk  i=l  1 SZk  j=l  3 SZk 

(3.2.3) 

g i (z  *)  =o 

for  i = 1 , 2,  . . . , l , 

(3.2.4) 

n (z*)  > o 

for  j = 1,  2,  ...»  m. 

(3.2.5) 

y . h . (Z* ) =0 

3 3 

for  j = 1 , 2 , . . . , 

(3.2.6) 

yj  < U for 

j — 1»  2,  «.«,  m. 

1 m 


such 
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Furthermore 


these  conditions  (called 


Kuhn- 


Tucker  conditions)  are  also  sufficient  if  the 
following  are  satisfied: 

(3.2.7)  f ( Z ) is  concave, 

(3.2.0)  g i( Z ) is  linear,  for  all  i = 1,  2,  ...,  i, 

(3.2.9)  h_.(Z)  is  concave,  for  all  j = 1,  2,  ...,  m. 

In  order  to  apply  this  theorem,  we  use  the  following 
matrix  notations: 

Let  A = (Aij)  be  a ( 2n  — 2 ) x n matrix  defined  by 

(3.2.10)  Aij  = Pj  if  0 < (i  mod  2-^)  < 2^  1 , 

* 0 if  2^-1  < (i  mod  2^)  < 2j. 

where  the  notation  "x  mod  y”  is  used  to  represent  the 
remainder  of  x divided  by  y. 


Let  b = 

(b1# 

, • • 

• / 

T 

b n ) 
2-2 

be  defined 

by 

n 

n 

(3.2 

• 11) 

bi  = 

(1  - p) 

w 

( E A ) 
k=l 

/ (1  - 2 
k=l 

A • , 
lk 

also 

, denote 

W = 

(E [W1] , 

E 

[w2],  ... 

, E [Wn] ) , 

and 

£.  = 

( pr  P2 

/ 

. . • , Pn  ) 

. 

Then,  if  we  let 

(3.2.12)  g (W)  = pT  W - p W,  and 


(3.2.13) 


h ( W)  = A W - b, 

we  see  that  constraints  (1.3.2)  and  (2.1.4)  can  be  written 
as  g (W)  = u and  h(W)  _>  U respectively. 

Now,  in  order  to  change  our  minimization  problem  to  a 
maximization  problem,  we  define  f(W)  to  be  the  negative  of 
our  objective  function.  That  is,  we  let 


f (W)  = - I \k  ck  ( E [Wk]  - sk  ) . 

k=l 

If  we  denote 


s - (Sj_,  s2  , • • • r sn)  , 


and  let  D = (Dij)  be  an  n x n matrix  defined  by 


Dij 


'ici 


if  i = j , 


if  l T j . 


then,  f(W)  can  be  written  as 


(3.2.14)  f (W)  = sr  D S - 2 sT  D W + WT  D W. 

From  the  definitions  of  the  functions  f,  g,  h above,  it 
is  easily  seen  that  conditions  (3.2.7)  through  (3.2.9)  are 
satisfied.  Thus,  the  Kuhn-Tucker  conditions  (3.2.2)  through 
(3.2.6)  become  sufficient  in  providing  optimal  solutions. 

We  will  now  derive  these  Kuhn-Tucker  conditions  for  our 
problem  and  then  see  how  the  solution  of  this  set  of 
relations  can  be  obtained  by  applying  a modified  linear 
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programming  method  using  the  artificial  variable  technique. 

First,  substitute  (3.2.12)  through  (3.2.14)  into 
(3.2.2),  we  have : 

T T T T 

- 2 s D + 2 W D - x£  - £ A = 0. 

Let  x = xL  - x2  be  the  difference  of  two  non-negative 

variables  and  let  £.'  = - y_,  substituting  into  the  above 

T 

equation  ana  taking  the  transpose,  keeping  in  mind  that  D 
D,  we  obtain 

2 DW-px^  + px2  + ATy ' = 2 D s. 

Denote  x/  = (x-^,  x2)  , then  in  matrix  notation,  the  above 
equation  becomes 

(3.2.15)  ( 2 D | -p,£  I AT  ) ( W I x'  I y'  ) = 2 CL  s.. 

Next,  substitute  (3.2.12)  into  (3.2.3),  we  have 

(3.2.16)  PT  W = p W . 

Then,  substitute  (3.2.13)  into  (3.2.4),  we  obtain 

(3.2.17)  A W > b. 

Now,  if  we  let  T = h(W)  = A W - b be  non-negative  slack 
variables  for  the  inequalities  (3.2.17),  then  (3.2.5) 
becomes 
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y h (W)  = y .T  . = - y’  T . = 0 
- - ~ 3 3 3 3 

or,  equivalently, 

(3.2.16)  y'.  T.  = U,  for  j = 1,  2,  ...,  2n-2. 

3 3 

We  recall  that  in  our  problem  of  determining  optimal 
W = (E  [W  J , E[W0],  ...»  E[Wn]),  each  E [ W • ] is  required  to  be 
positive.  Also,  x'  is  defined  to  be  non-negative.  By  the 
definition  of  y.'  ana  (3.2.6),  is  also  non-negative. 

Thus,  we  have 

(3.2. 19)  W,  x 1 , y/ , T > 0. 

Therefore,  the  Kuhn-Tucker  conditions  for  our  problem 
can  be  summarized  as  (3.2.15)  through  (3.2.19). 

To  solve  the  simultaneous  equations  and  inequalities 
(3.2.15)  through  (3.2.19),  we  apply  the  artificial  variables 
technique,  and  introduce  to  each  linear  equality  and 
"greater  or  equal"  constraints  a non-negative  artificial 
variable  to  formulate  a pseudo  minimization  problem  with  the 
sum  of  introduced  artificial  variables  as  the  objective 
function.  According  to  this  technique,  if  the  minimum  value 
of  this  pseudo  ob3ective  function  is  zero  (which  indicates 
that  all  the  introduced  artificial  variable  have  vanished), 
I then  the  original  problem  is  feasible,  and  an  initial 

optimal  solution  can  be  directly  obtained  from  the  solution 
of  the  pseudo  optimization  problem  (c.f 


linear  programming  book) . 


• r 


any  standard 


I 

I 

f 


For  our  problem  of  seeking  an  optimal  solution  to 

(3.2.14)  with  constraints  (3.2.15)  through  (3.2.19),  we  will 
now  temporarily  neglect  (3.2.18)  and  (3.2.19),  and  add 

n + 1 + (2n  - 2)  = 2n  + n - 1 

artificial  variables  R-.  through  R _ to  constraints 

2 +n-l 

(3.2.15) ,  (3.2.16)  and  (3.2.17)  and  obtain  a minimization 
problem  in  the  following: 


subject  to 

(3.2.20) 

(3.2.21) 

(3.2.22) 


2n+n-l 

Minimize  £ R, 
k=l 

(3.2.18),  (3.2.19)  and 
(2D|-p,pjAT|0  11,0,0)  (W|x'  ly.'  |T|R) 

(pT|  0,0|  0|0|0,I,0)  (W|x'  ly.'  |T|R) 

( A|  0,0101-1 10,0,1)  (W)x’ )Z' )T)R) 


= 2Ds, 


= PW, 


= b. 


Except  for  the  constraint  (3.2.18),  the  constraints  are 

linear.  The  simplex  method  can  be  used  with  a slight 

modification  to  solve  this  problem.  First,  temporarily 

neglect  (3.2.18)  and  formulate  a linear  programming  problem. 

Then,  we  use  the  simplex  method  in  solving  this  problem  with 

the  additional  requirement  that  for  every  iteration  of 

generating  a new  basic  feasible  solution,  we  check  (3.2.18) 

to  be  sure  that  only  one  of  the  two  variables  y.'  and  T.  are 

3 3 

a basic  variable.  That  is,  if  one  of  the  two  variables  are 
already  a basic  variable,  then  the  other  variable  cannot  be 
the  variable  entering  the  basis  unless  the  other  variable  is 


L 
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the  vaciabie  leaving  the  basis.  This  guarantees  that  the 
solution  obtained  satisfies  (3.2.18). 

We  have  now  completely  solved  our  quadratic 
optimization  problem.  It  should  be  noted,  however,  that  the 
above  solution  procedure  involves  solving  a programming 
problem  with  a large  number  of  variables,  in  particular, 
when  the  number  of  priority  classes  is  large.  An 
alternative  solution  method  is  described  below  that  may  be 
simpler  than  using  the  above  algorithm.  This  solution 
method  is  particularly  useful  when  the  optimal  solution  lies 
in  the  interior  of  the  feasible  performance  space. 

In  the  following,  we  discuss  the  objective  function 


(3.2.1)  in  ge 

ner  al 

(m  not  restricted  to 

*' 

being 

equal  to 

two)  , 

and  describe 

an 

iterative 

method 

for 

solving 

the 

opt imization 

of 

(3.2.1) . 

This  iter 

at  ive 

method 

can  be 

simply  stated  as  follows: 


First,  solve  the  problem  of  opt 
the  constraints  relaxed  (temporar 
solution  to  the  relaxed  problem 
determine  if  this  solution  s 
constraints.  If  all  the  constraint 
this  is  the  optimal  solution  to 
However,  if  some  of  the  constraints 
violated  constraints  are  added  a 
again.  This  process  is  repeated  unt 
all  constraints  is  found. 


imization  with  some  of 
ily  neglected) . After  a 
is  obtained,  check  to 
atisfies  the  neglected 
s are  satisfied,  then 
the  original  problem, 
are  violated,  then  the 
nd  the  problem  is  solved 
il  a solution  satisfying 
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To  solve  our  optimization  problem  (3.2.1),  we  first 
relax  all  the  inequality  constraint  (2.1.4)  and  only  keep 
the  constraint  (1.3.2).  This  relaxed  constraint 
optimization  problem  can  be  solved  using  Lagrange's 
multiplier  method. 


n 


Let  L = Z \kck(E[Wk]-sk)m-  £(  Z pkE[Wk]-pW) 

k — 1 k— 1 


n 


Lagrange's  necessary  condition  for  optimality  is: 


(3.2.23)  It  = 0 ' and 
o * 


(3.2.24) 


3E[W1a 


= 0 , for  1 < k < n . 


Now,  for  each  i,  i=  1,  2,  ...,  n, 


Sl/dE[Wk]  = mXici(E[Wi]-si)m_1  - £Pi  . 


setting  them  equal  to  0,  we  obtain 

(3.2.25)  E [W  i)  =si+  ( £/m  ) 1/(m_1).  (l<i<n) 

The  first  Lagrange  condition  3L/S2  = 0 requires 
n 


(1.3.2)  e PkE[Wk]  = pW 


k=l 


Substitute  (3.2.25)  into  the  above  equation,  we  get 


n 


n 


. _ . , i . m—  1 , 1*  / . . m—  1 . — 

k-A5*  + ” k-iPk/ “k<=k  1 = ° 


Thus , 


/ L ) m~  i 


n 


(PW-  I Pksk)  / ( Z Pk  / (^kck)m-1) 
k — X k~ 1 
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We  therefore  have,  for  i = 1,  2,  n, 

1 1 


(3.2.26)  E[Wi]  = si+ 


= s 


i + (pW-kJ1‘,ksk)'/((“ici,'”‘1k^1'k/(>‘kck) 


1 

, m-1 


This  solution  can  now  be  substituted  into  (2.1.4)  to 
see  if  all  the  constraints  are  satisfied.  If  they  are,  then 
it  is  the  optimal  solution.  IF  not,  we  add  the  constraints 
that  are  violated  and  solve  the  more  restricted  mathematical 
programming  problem.  In  the  case  when  m=2,  we  have  a 
quadratic  programming  problem  as  above  with  less 
constraints.  If  m>2,  we  have  a convex  objective  function 
and  a set  of  linear  constraint.  There  are  various  ways  of 
solving  this  type  of  optimization  problem.  For  example,  see 
[TAHA76].  One  iteration  of  adding  constraints  should 
normally  solve  the  problem.  The  resulting  solution  will  be 
on  the  boundary  and  will  be  either  fixed  priority  or  MEFP 
scheduling . 

Now  let  us  consider  the  special  case  of  (3.1.2)  when 
all  s ^ = 0.  Using  (3.2.26)  as  the  solution  to  the  relaxed 
problem  (3.1.2)  with  constraint  (1.3.2)  only,  we  have 


(3.2.27)  E [Wx) 


= ( 


E[W2 

1 


E [Wn] 


^lcl 


m-1 


, m-1 


m-1 


^2C2 


c ' 
n 
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ECw.l 


Figure  3.2  A Two-Class  Queueing  Model 

Figure  3.2  shows  a two-class  queueing  model  with  cost 
function  defined  by  (3.2.1)  for  m=2  with  positive  s L and  s2. 
From  (3.2.26),  we  have 


E [W-l]  = 


= sn  + 


pW  — P ^ — ^2S2 


1 Pi  p2 

<|il=:L)  ‘:2=2) 


and  E (W21 


s2  + 


pW  — ^1^1*  P2S2 

P1  p2  \ 
(m'2C2)  V1c1+  ^2c2^ 


Thus,  if  W = (E[WiJ,  E [W2 ] ) lies  between  Q12  and  Q21, 
then  it  is  an  optimal  solution.  However,  if  it  does  not, 
then  we  need  to  use  the  quadratic  programming  solution 
procedure  that  we  described  earlier  to  obtain  a solution. 
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In  this  example,  the  point  S is  defined  as  (s-^,  s2). 
It  is  interesting  to  note  the  following  behavior  of  the 
objective  function:  at  the  point  S,  EtW^)  = s-,  and  E[W2J  = 
s2,  so  that  the  objective  value  is  0.  As  the  objective 
value  increases,  the  contour  of  all  points  with  the  same 
objective  value  are  elipses  centered  at  S.  We  note  that  the 
line  L:  PiElwil  + P2Etw2}  = W is  tangent  to  the  contour 
with  the  optimal  objective  value. 

3 . 3 Examples 

In  tnis  section,  we  illustrate  the  optimization 
techniques  that  were  described  in  Sections  3.1  and  3.2  with 
a series  of  numerical  examples  using  a 3-class  priority 
queueing  model  with  the  following  characteristics: 


X1  - 

2 

X2  = 18/5 

V3  = 

16/5 

E [S  x]  = 

1/10 

E[S2]  = 1/12 

e[s3]  = 

1/8 

VarlSj]  = 

1/240 

Var  [S  2]  = 1/180 

Var  [S  3]  = 

1/320 

E [S  f]  = 

17/1200 

E [S  |]  = 1/80 

E [S  3]  = 

3/160 

“1  = 

10 

^2  = 12 

P-3  = 

8 

P1  = 

0.2 

P2  = 0-3 

P3  = 

0.4 

And,  p = 9/10,  W = 2/3. 


The  constraints  (1 . 3 . 2)  and  (2.1.4)  are: 

(a)  0.2  ElWj]  + 0.3  E[W21  + 0.4  E [W  3]  * 3/5 
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(b.l) 

0.2 

ElW^ 

>_ 

1/60 

(b.  2 ) 

0.3 

e[w2] 

> 

1/35 

(b.3) 

0.4 

e[w3j 

> 

2/4  5 

(b.4) 

0.2 

EIW^ 

+ 0.3 

eiw2] 

1/15 

(b.  5) 

0.2 

e[wl] 

+ 0.4 

E [W  3] 

> 

1/10 

(b.6) 

0.3 

e[w2] 

+ 0.4 

e[w3] 

> 

7/45 

The  space  defined  by  the  above  constraints  is  shown  in 
Figure  3.3. 


Figure  3.3  A 3-Class  Priority  Queueing  Model 


We  now  give  the  examples. 

n 

Example  1.  Minimize  I x,c,E[W,  ] 


foe 


3 


5. 


C1  = 


c 2 = 4' 


c , = 


From  the  "pc"rule,  we  have 


^ ^c  ^ = 3 0 , 11 2C  2 = 4 ® * p.^C3=40. 

Since  n2c2  > ^3C3  > ^ici»  fc^e  °Ptimal  scheduling  is  a 
fixed  priority  scheduling  that  sets  class  2 as  highest 
priority  class,  class  3 the  next  highest,  and  class  1 the 
lowest  priority  class. 


Under  this  setting,  (3.1.3)  gives 
W = (20/9,  2/21,  20/63)  . 


Now,  if  we  rename  the  class  indices  such  that  l'=2, 
2'=3,  3'=1,  then  using  the  algorithm  of  Theorem  2.4,  we  get 

r • =0  and  r^  =0.  Substitute  r i =r 2 = e (a  verY  small 
quantity)  and  use  (2.2.10),  we  have 

or^=  1/ e , or 2 = 1/®  , “ 1. 

Let  G-l/e  , then  G is  a very  large  number,  and  we  have 

2 

Of ^ = 1,  <^2  _ G , a ^ ~ G. 

This  set  of  values  can  then  be  used-  in  escalating 
priority  scheduling  to  approximate  the  optimal  fixed 
priority  scheduling. 
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n 

Example  2.  Minimize  £ \ c E[W  ] 

k=l  k k k 

foe  c1=4,  c 2 = 3 , c 3 = 5 . 

In  this  case,  we  have 

HjO  2 =40,  ^2C  2 = 36,  ^3C  3 = ^0. 

Since  p,^c  ^ = p,  c y we  have  multiple  optimal  solutions. 
Among  the  extreme  points,  Q^-?2  an<^  ^312  are  optimal 
solutions  because  p,^c  ^ ^3C  3 * ^2C2  anc^  ^3C  3 — ^1°  1 > ^2C  2' 

The  convex  combination  of  these  two  points  are  the 
whole  line  segment  Q 13^  312'  Thus  all  points  lying  on  the 
line  segment  213^312  are  optimal.  From  (3.1.3)  we  have, 
Q 132  = (1/12,  5/3,  5/24)  and  Q312=  (5/18,  5/3,  1/9).  Thus, 
the  optimal  solutions  are  kQ132  + (l-k)Q312#  for  0£k£l , or 

W — ( (5/18) - (7k/36) , 5/3,  (1/9 ) + (7k/72 ) ) . 

One  set  of  a^’s  for  these  optimal  solutions  is 

= aG,  a 2 ~ 1 , a 3 = bG, 

where  G is  a very  large  number  and  a,  b are  small 
positive  quantities. 


Example  3.  Minimize  max  { \^c  . E [W ^]  ; k=l,2,3} 


for 


cl=  3, 


c 2 « 4, 


c3=  5. 
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We  first  find  a solution  to  the  relaxed  problem  with 
constraint  (a)  only.  From  (3.1.10),  we  have 

W = (24/19,  10/19,  9/19) . 

Since  all  the  constraints  (b.l)  through  (b,6)  are 
satisfied,  this  solution  is  optimal. 

Using  the  algorithm  of  Theorem  2.4,  we  get 

= 1,  c*2  =2.8707,  c*2  = 3.2448. 

Example  4.  Minimize  max  ^]CC]CE  * k=l,2,3} 

for  c^  = 1,  c2  = 10,  c 3 = 10. 

We  first  use  (3.1.10)  to  find  a solution  to  the  relaxed 
problem  with  constraint  (a)  only.  We  have 

W = (72/29,  4/29,  9/58) . 

Now,  when  we  check  the  constraints  (b.l)  through  (b.6), 
we  see  that  (b.6)  is  violated.  Therefore,  we  need  to  use 
the  two-step  procedure  of  solving  n linear  programming 
problems.  Note  that  in  this  case,  the  solution  must  lie  on 
the  boundary. 

According  to  the  solution  procedure,  we  have  the 
following  three  optimization  problems  (the  c constraints  are 
from  constraint  set  (3.1.5)): 
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(i)  Minimize  2 EtW^]  subject  to  (a),  (b.l)  ~ (b.6), 

and  (c.l):  2 ElWj  > 36  E[W2], 

(c . 2)  : 2 ElWjJ  > 32  E[W3]  . 

( i i ) Minimize  36  E[W2]  subject  to  (a),  (b.l)  ~ (b.6), 
and  (c.  3)  : 36  E[W2]  _>  2 E [W  -j^]  , 

(c  . 4 ) : 36  E [W  2]  > 32  E [W0]  . 

(iii)  Minimize  32  E[W3]  subject  to  (a),  (b.l)  ~ (b.6), 

and  (c . 5)  : 32  E[W3J  > 2 ElW^, 

(c  . 6)  : 32  E [W 3]  > 36  E [W 2]  . 

For  the  optimization  problem  (i),  we  find  that  the 
solution  space  is  empty.  So,  we  need  only  solve  (ii)  and 
(iii).  The  solution  spaces  for  these  two  optimization 
problems  are  shown  in  Figure  3.4. 


ElW|] 

Figure  3.4  Solution  Spaces  for  Example  4 
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The  optimal  solution  for  both  of  these  problems  are 


W = (20/9,  28/135,  7/30)  . 

This  is  then  the  optimal  solution  to  our  original 
problem. 

The  procedure  in  Theorem  2.4  gives 

= 1,  £*2  = 1 • 1887G , a ^ = G. 

Example  5.  Minimize  max  {x  c E[W  ];  k=l,2,3} 

K K K 

for  c i = 1,  c 2 = If  c 3 = 10. 

Again,  we  first  obtain  a solution  to  the  relaxed 
problem  with  constraint  (a)  only.  From  (3.1.10),  we  have 

W = (72/47,  40/47,  9/98) . 

This  solution  is  infeasible  because  constraint  (b.3)  is 
violated.  Thus,  we  need  to  solve  the  following  three 
optimization  problems: 

(i)  Minimize  2 E[W^]  subject  to  (a),  (b.l)  ~ (b.6), 

and  (d.l):  2 E [W  j]  > 3 .6  E [W2  ] , 

(d  . 2)  : 2 E [W1]  > 32  E [W3J  . 

(ii)  Minimize  3.6  E[W21  subject  to  (a),  (b.l)  ~ (b.6), 

and  (d.3):  3.6  B [W2 ] > 2 E[W1], 

(d . 4 ) : 3.6  E[W2]  > 32  E [W3 ] . 


For  problem  (iii)  , we  have  multiple  solutions  as  the 
whole  line  segment  connecting  (35/27,  80/81,  1/9)  and  (16/9, 
2/3,  1/9). 

Since  all  three  optimization  problems  have  che  same 
objective  value  32/9,  all  the  solutions  are  optimal  to  our 
original  problem.  That  is,  our  solution  is  the  line  segment 
connecting  (35/27,  80/81,  1/9)  and  (16/9,  2/3,  1/9). 

For  the  point  (16/9,  2/3,  1/9),  we  obtain 
ot  ^ = 1,  oi  2 ~ 4,  = G. 

For  the  point  (35/27,  80/81,  1/9),  we  obtain 
= 1 / a 2 ~ 1*4,  a 3 = G. 

Thus,  for  any  a,1.4<a<4,  using  escalating  priority 
scheduling  with  the  parameters  as 


will  approximate  the  desired  optimal  performance. 

3 2 

Example  6.  Minimize  E ^CjElW^] 

k—  1 

for  cl=  3'  c 2 = 4,  c 3 = 5. 

We  first  find  a solution  that  satisfy  constraint  (a) 
only.  From  (3.2.26),  we  have 


98 


It  can  be  seen  that  all  the  constraints  (b.l)  through 
(b.6)  are  satisfied.  Thus,  this  is  optimal  to  our  original 
problem. 

The  procedure  of  Theorem  2.5  gives 

= 1,  a 2 *7  908 , a3  = 6*2237. 

3 2 

Example  7.  Minimize  E \ c E [W J 

k=l  k K K 

for  c i = 1,  c 2 = 10,  c 3 = 1. 

We  first  use  (3.2.26)  to  obtain  a solution  to  the 
relaxed  problem  with  constraint  (a)  only.  We  have, 


W = (24/29,  2/29,  30/29). 

This  solution  is  infeasible  since  constraint  (b.2)  is 
violated . 

Thus,  we  need  to  use  Kuhn-Tucker  theorem  to  solve 
problem.  First,  we  add  only  the  violated  constraint 
to  constraint  (a) , and  solve  the  maximization  problem: 

3 2 

Maximize  - E ^P^lW^  * 
k=  1 

The  Kuhn-Tucker  conditions  are: 

-2x1c1E[W1l  - PjX  1 + p^  2 +p  iy-[  +01Y3  +Pi  Yg  = ~2Xj.Cj.Sj_, 

-2X2c^[W2)  ~ p ^ x ^ +P2X2  +p2y2  +p2y3  +p2  y6  = ~2X^2S2' 

-2x3c3E[W3J  +P2X2  +p3Y4  +p3Y5  + P3  y£  = -2X3c3s3, 


this 

(b.l) 
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= P w. 


I 


PiE  tWi ] + p2EtW2]  + P 3E [w3 ] 

P2E[W2]  > 2/35. 

Adding  a slack  variable  T to  the  last  inequality 
constraint,  then  add  through  to  each  of  the  above 

constraints,  by  using  artificial  variable  technique,  we 
obtain 


W = (40/49,  2/21,  50/49). 

This  solution  satisfies  all  the  constraints  (b.l) 
through  (b.6).  Thus,  this  is  the  desired  optimal  solution. 


For  the  values  of  c^,  we  obtain 


= 1.3043,  a 2 — G, 


a 2 ~ 1 • 


The  above  examples  illustrate  some  of  the  objective 
functions  that  can  be  used  in  escalating  priority 
scheduling.  In  particular,  note  that  different  objective 
functions  with  the  same  cost  coefficients  { c were  used  in 
examples  1,  3 and  6.  Comparing  Example  1 and  Example  6,  we 
see  that  when  we  change  the  cost  function  from  linear  to 
nonlinear,  the  optimal  solution  becomes  an  interior  point 
which  corresponds  to  an  escalating  priority  scheduling. 
However,  the  ordering  of  priority  is  not  changed.  Example  3 
is  a completely  different  cost  function  and  the  ordering  of 
priority  for  the  optimal  solution  is  different. 
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CHAPTER  4 


BEHAVIOR  OF  ESCALATING  PRIORITY  SCHEDULING 


4 . 0 Introduction 

In  this  chapter,  we  discuss  additional  model  behavior 
for  multiclass  M/G/l  queueing  systems  under  escalating 
priority  scheduling  and  make  comparisons  of  the  means  and 
the  variances  of  waiting  times  of  escalating 

priority  scheduling,  fixed  priority  scheduling  and  FCFS 
scheduling.  Without  loss  of  generality,  we  will  assume  that 


In  Section  4.1,  we  first  investigate  the  bounds  of  mean 
waiting  times  for  each  priority  class.  Then  we  derive 
bounds  on  the  ratios  of  mean  waiting  times  of  any  two 
consecutive  priority  classes,  and  discuss  the  sensitivity  of 
t.iese  ratios  to  the  variation  of  system  congestion.  In 
Section  4.2  we  discuss  mean  waiting  times  behavior  at 
saturated  congestion  and  show  that  the  ratios  of  any  two 
mean  waiting  times  are  inversely  proportional  to  the  ratios 
of  the  control  parameters  associated  with  them.  In  Section 
4.3,  we  investigate  the  variances  of  waiting  times  for  each 
class  of  joos  under  various  settings  of  the  parameters 


4 . 1 Behaviors  of  Mean  Waiting  Times 


In  this  section,  we  discuss  various  behaviors  of  mean 
waiting  times  in  escalating  priority  scheduling  systems.  We 
first  derive  bounds  of  mean  waiting  times  for  each  priority 
class.  Then  we  derive  bounds  on  the  ratios  E lw  i+  jl /E  [W 
given  a specific  setting  of  parameters  { } , and  discuss 

the  change  of  these  ratios  between  the  two  bounds  as  a 
function  of  system  congestion.  This  gives  the  sensitivity 
of  waiting  times  to  system  fluctuations. 

In  Chapter  2,  we  showed  that  for  any  adjustable 
priority  scheduling  rule,  mean  waiting  time  for  each 
priority  class  must  be  bounded  by  (2.1.1).  Since  escalating 
priority  scheduling  belongs  to  the  category  of  adjustable 
priority  scheduling  rules,  therefore  (2.1.1)  gives  bounds  of 
E [WjJ  for  each  class  i.  However,  these  bounds  are  derived 
under  the  assumption  that  each  class  can  be  assigned  any 
arbitrary  ordinal  of  priority.  We  will  now  study  the  bounds 
of  mean  waiting  times  given  that  a class  must  be  assigned  as 
the  highest  priority  class,  or  the  second  highest  priority 
class,  etc. 

These  bounds  are  given  by  the  following  theorem: 

THEOREM  4.1.  in  multiclass  M/G/l  queueing  systems  under 
escalating  priority  scheduling,  given 

0^2.  — °n ' 


we  have 


(4.1.1) 


(l-pJW/d-o^  < E[Wi]  <W/(l-uM). 

Proof : (a)  From  Theorem  2.4,  we  have 

ElV^]  < E [ W 2J  ...  < E[Wn]  . 

Thus,  for  l<k£ i , we  have 

»kEIV  i pkE|wil- 

i i i 

Therefore,  z pkE[Wkl  < £ P^lWj  = ( £ P^E^]  = aiE[Wi 

k=l  k=l  k=l 

Now,  from  Theorem  2.2,  we  nave 

i 

(1-0  )Woi/(l-cri)  < £ PkE[Wk]. 

k — 1 


Thus , 

(1-P )aiW/(l-ai) 

< E[Wi] 

or , 

(l-P)W/(l-ai)  < 

E[Wi]  . 

(b)  From  Theorem  2.4,  we  have 


E [W  ±]  < E [W  i+1]  < ...  < E [Wn  ] . 


Thus,  for  iXk<n,  we  have 


’kE|wi'  i »kElwk)- 


n n n 

Therefore,  £ p.EtW.J  = ( E Pk)  E [ w ±]  < £ P^lWjJ 

k=i  k=i  k=i 
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But,  from  Theorem  2.2,  we  have 


1 


n _ n 

E p.ElW.  ] < W(  E Pv)/U-Oi_i)- 
k=i  k K k=i  K 

n _ n 

Thus,  ( E o ) E lW  . 1 < W(  E p.)/(l-o.  \ 

k-i  * 1 k*i 

or,  E[Wi]  < W/(l-oi_1) . 


Figure  4.1  illustrates  these  bounds  of  mean  waiting 


times  for  a 5-class  queueing  model.  The  mean  waiting  times 


for  fixed  priority  scheduling  and  FCFS  scheduling  are 


represented  as  the  dotted  line  and  the  broken  line 


5-Class  Priority  Queueing  Model 

We  remark  that  in  order  that  a set  of  mean  waiting 
times  {E [W^ ] } be  feasible  using  escalating  priority 
scheduling,  they  not  only  need  to  lie  between  the  bounds  as 
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given  by  (4.1.1),  but  also  need  to  satisfy  the  following: 

(1.3.2)  P iE  IW11  + P2EIW2]  + ...  + PnElWn]  * PW, 
and , 

(2.2.2)  E [Wx  ] < E[W2]  < ...  < E[Wn]. 

Next,  we  derive  the  bounds  of  the  ratio  E [w  /E  [W  J 
for  each  class  i,  given  a specific  setting  of  the  parameters 
{ } . 

From  Theorem  2.4,  if  we  are  given  >.  “2  — • • * -°n  ' 

then  using  escalating  priority  scheduling,  we  have  E [ W 1 ] < 
E [W2  ] < ...  <E[Wn).  Thus,  the  lower  bound  of  E [Wi+ ^ ] /E  [W  i] 
is 

(4.1.2)  E[Wi+1]/ElWi]  > 1.  (l<i<n-l) 

The  upper  bound  of  E [W^+ /E  [W^]  can  be  obtained  from 
the  following  theorem: 

Theorem  4.2.  In  multiclass  M/G/l  queueing  systems  under 
escalating  priority  scheduling,  given 

<*2  — ***  - an'  we  have 

(4.1.3)  ffjElWj  > c*2E[W2)  > ...  > «nE  lWn]  - 

Proof:  From  Theorem  1.5,  we  have 
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E[WiJ 


W-  E (l-^)p  E[W  ] 
k=i+l  “i  k k 


i-1 

1-  E (1-— )p 
k=l 


V k 


Let  the  numerator  and  denominator  of  the  above  expres- 
sion be  and  respectively.  Then, 


n 


n 


N.  = W-  E P.E[W  ] + — E a p E[W  ] , 
1 k=i+l  K K “i  k=i+l  K K K 


n 


n 


Ni+1  = W-  E PkE[W]+- 

1+±  k=i+2  K K ai+l  k=i+2 


E 'VkE(wk1 


n 


n 


* w-  E c E[W  1 +—  E o,kOkE[W  ] . 
k=i+l  l+l  k=i+l 

i i Dk 

i k=l  i+1 


1 Pi 


= 1 - a.  + a.  . , E — , 

1+1  k=i  «k 


i-1  Pk 

D • , = 1 - a ■ i + a,  E — 
1+1  1+1  1 k=l  ak 


1 pk 

= 1 - a,  + a,  E — 
1 1 k=l  ak 


SO,  aiE[WiJ-ari+1E[Wi+1] 


a iNi  ai+lNi+l 


D . 
l 


'i+1 


°'iDi+lNi"0fi+lDiNi+l 

DiDi+l 
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The  numerator  ° iD  i+ iN  i-0  i+ 1D  iN  i+ 1 


i Pi 


* (ori(l“Oi)  + «i«i+1  ^ «k> 

x <w  - " , \E  'V  + 7;  * , “x  pkE  1 V > 


k=i+l 


n 

E 

1 k=i+l 
i P, 


- <«i+l(1“’i>  +»i“i+l  «k’ 


(W-  ” 0kE (W  J E o.kOkE[wkl) 

' K * ai+l  k=i+l 


k=  i+1 


n 


- J :+1pkE[V) 


1 pk,  n 


- ( = 7s)  < E “kpkE[wk1)} 

k=l  k k=i+l 


- " »kElwkU,1‘°i)  + 

k=i+l  J~x  J 


? PVE[W.]  [1-  E (l-^Pjl) 

k-i+1  3 = 1 J 


j-1 

k-1  o . h-1  ak 

X (1-  E 0-  -^>0,  + . E <1--)Pjl) 
j-1  “j  1 3=1+1  3 
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n 


- <“i-“i+i)f<1-<ii>w-k^+1|!)CE|wkIDk 


n 


k-1 


E p E[W  ] S (l--^)p  ] 
k=i+l  k k j=i+l  aj  3 


n 


N, 


= (1_ai)W“  2 Pk  D °k 

1 1+1  1 k=i+1  * Dk  K 


n k-1 

E 


a-L 


v . , . E , (1-~ 
k=i+l  ]=i+l  3 J 


n 


= (arauJfd-^JW-  ^ PkN 

i l+l  l k=i+l  * K 


n k-1 

E 


O', 


v . , . 1 <1-ir",j,'kE[wk1) 

k=i+2  3=1+1  3 J 


n 


= (a. -a..  ,)(  (l-c.)W-  E p (W  - E ( 1 - — 1 '-)  p . E [W  . ] 
1 1+1  1 k=i+l  k j=k+l  “k  3 3 


Of. 


n 

E 


n 


. . . „ ? 1<1-5T7)l,jl,kE[wkI) 

3=1+1  k=3-l  3 J 


n 

^ai-ai+l^  t (1-CTi)w  “ 

( 2 p )W 

k=i+l  k 

n 

n 

a • 

+ E 

E (1 

--1)  p p ,E[W.] 

k=i+l 

j=k+l 

ak  k 3 3 

n 

n 

av 

E 

j=i+l 

2 (1 
k=j-l 

-^)PjpkEtWk] 

n 


= («r“i4.i)((1'ci-  z PvW 
1 1+i  1 k=i+l  k 


= (1-P)W  • 

Thus,  aiE[Wi]-ai+1E[Wi+1]  = («i-«i+1)  (l-P)w/D±Di+1  ^0 
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From  (4.1.2)  and  (4.1.3),  we  have  the  following  bounds: 


(4.1.4)  1 < E [Wi+1l/E  [Wi]  < (l<i<n-l) 


Note  that  these  bounds  are  independent  of  the 
congestion  factor  p. 

We  next  examine  the  change  of  these  ratios  of  mean 
waiting  times  as  P increases  from  0 to  1 . We  will  assume 
that  the  arrivals  from  each  class  increase  proportionately 
so  that  X^X  is  fixed  for  all  i.  Under  this  condition, 

(4.1.5)  Pi  = P^P 

is  fixed. 


The  ratios  of  E [W  i+  /E  [W  i]  can  be  obtained  by  first 
examining  the  ratios  E[WiJ/W: 

From  (1.3.10),  we  have 


EIW^/W 


n 
Y 

k=i+l 


o. 


1-  Y (1-— )pk(E[Wk]/W) 

a i 


i-1  a ■ 

1-  Y (1  - -i)  Pk 
k=l  ak  K 


Substituting  (4.1.5)  into  the  above  equation,  we  obtain 


n 


(4.1.6) 


E[Wi)/W 


l-(  Y (1  )pk(E[Wk)/W))p 
k=  i+1  ai 


i-1  oi  i 

i-  ( S ( 1 “ “ ) Pk ) p 
k=l  a\ 
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which  is  a function  of  p only. 


Therefore,  given  fixed  ratios  of  arrivals  among 
different  priority  classes,  (4.1.6)  can  be  used  recursively 
(from  n backwards  to  1)  to  calculate  EtW^l/W  for  various 
congestion  rates. 

To  illuatrate  this,  we  consider  a 5-class  queueing 
model  with  the  following  characteristics:  E[S^]  = 1, 
E [S^ ] = 2,  xi=  X/5,  «i=  1/i,  and  pi  = 1/5,  for  l<i<5. 

Figure  4.2  depicts  the  change  of  each  E[Wi]/W  in  the 
range  0 < p < 1.  The  gradual  slopes  of  these  curves  show 
that  for  small  fluctuations  in  congestion,  the  ratios  of 
waiting  times  between  any  two  priority  classes  will  not 
change  much.  Even  over  a rather  large  range  of  P , the  mean 
waiting  time  ratios  do  not  have  a large  change. 

Since  W itself  is  an  increasing  function  of  P,  we  note 
that  as  p approaches  1,  each  E[W^]  will  approach  infinity. 
The  mean  waiting  times  E[W^]  and  W are  shown  in  Figure  4.3. 

For  comparison,  we  show  the  corresponding  measures  of 
the  same  system  operated  under  fixed  priority  scheduling  in 
Figures  4.4  and  4.5. 
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The  mean  waiting  times  behaviors  of  escalating  priority 
scheduling  in  Figures  4.2  and  4.3  show  that  as  system 
congestion  increases,  every  priority  class  has  a significant 
increase  in  mean  waiting  time  and  that  strong  couplings  can 
occur  between  priority  classes.  However,  in  fixed  priority 
scheduling,  increased  congestion  has  almost  no  effect  on  the 
mean  waiting  times  for  higher  priority  classes,  but  a large 
effect  on  the  lowest  priority  class  as  can  be  seen  in 
Figures  4.4  and  4.5. 

We  remark  that  the  strength  of  couplings  between 
priority  classes  in  escalating  priority  scheduling  depends 
on  the  values  of  the  parameters  { As  the  ratio 

increases,  the  strength  of  coupling  between  classes  i and 
i+1  decreases.  In  the  limiting  situation  when  /ai+ 1 00  > 

this  coupling  effect  disappears,  and  we  have  a MEFP 
scheduling.  This  mixed  priority  scheduling  rule  has 

characteristics  of  both  fixed  priority  and  escalating 
priority  scheduling:  It  is  adjustable  so  that  relative 
waiting  times  among  priority  classes  can  be  controlled  and 
it  can  be  operated  at  saturated  congestion  while  still 
giving  finite  mean  waiting  times  to  the  higher  priority 
classes.  This  property  of  MEFP  scheduling  rule  was  first 
observed  by  Kleinrock  [KLEI66]  . 
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4 . 2 Waiting  Times  Behavior  at  Saturated  Congestion 


In  this  section,  we  investigate  mean  waiting  times 
behavior  at  saturated  congestion  that  is,  P 1 1 but 

remains  strictly  less  than  one,  preserving  stability.  We 
first  study  the  mean  waiting  times  behavior  of  fixed 
priority  scheduling.  From  Theorem  1.3,  we  have 

(1.3.4)  E[Wi]pp=  (l-pJW/a-c^)  (l-o^  . (l<i<n) 

From  (1.3.1),  W is  defined  as 

(1.3.1)  W = ( S \ E[sJ])/2(l-p)  . 

k=l 

n 2 

Thus,  E[Wi]pp=  ( £ \kE[Sk])/2  (l-a^p)  (1-^)  . (l<i<n) 

1 

Therefore,  for  l<i<n-l, 

lim  E [W ±]  Fp  < ®, 
p-»l 

while  for  i=n,  we  have 

lim  E [wn]  Fp  = 00 . 

p->l 


Therefore,  in  fixed  priority  scheduling  with  saturated 

traffic,  higher  priority  classes  will  have  finite  mean 

waiting  times  and  the  lowest  priority  class  will  have  an 

unbounded  mean  waiting  time.  This  behavior  is  reflected  in 

Figure  4.4  where  lim  E[W.  ]/W  = 0 for  i<n  and  lim  E[Wj/W  > 

p-»l  p-»l  “ 

0. 
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Next,  we  examine  the  ratios  among  the  E[Wi]'s  under 


escalating  priority  scheduling.  These  ratios  can  be 
obtained  from  the  ratios  {E[W^]/W}  so  we  will  first  study 
the  behavior  of  these  ratios.  Practically,  E[W^]/W  is  a 
measure  of  the  degree  of  discrimination  applied  to  class  i 
by  the  underlying  scheduling  relative  to  an  indiscriminated 
treatment . 

The  following  theorem  gives  the  ratios  {E[W^]/W}  when 
escalating  priority  scheduling  is  operated  under  saturated 
congestion . 


THEOREM  4.3.  In  multiclass  M/G/l  queueing  systems  under 
escalating  priority  scheduling,  in  the  limiting 
situation  when  p ^ 1,  we  have 

n 

(4.2.1)  lim  o.E[Wi]/W  = 1/  E (pk/ak),  (l<i<n) 

p-»l  k=l 

which  is  a constant  independent  of  the  class. 

Proof : Given  a set  of  finite  cx^ 1 s and  p = p ^ + p2  + •••  + Pn  ~ 1 , 

we  let 

n pv 

d = 2 — . 

k=l  ak 

Then,  d is  a finite  constant. 

We  will  prove  (4.2.1)  by  induction  (from  n backwards 
to  1 ) . 
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First,  from  (1.3.10),  we  have 


B[Wn] 


lim  — ^ 
p-»l  W 


n— 1 a 

1/tl-vE,(1-5?)pk) 

k=l  k 


n-1  n-1  p 

= l/u-  F p +%  L -f] 
k=l  K n k=l  ak 


= l/U-  ^ Pk+an(d"«  )] 

k=l  K n an 
= 1/  (1-  p + andj 

= 1 / and  . 

a E [W  ] . 

Hence,  lim  — = — . 

p-»l  W a 

Now,  suppose  (4.2.1)  is  true  for  k=i+l,i+2, 
then,  again  from  (1.3.10), 


lim 

'-1  W 1 - 1^1  ( ! - — ) p 


k=l  k K 


l-(  E PkE[Wk]  /W)  +~(  E c*vPvE[Wv]  /W) 


k=i+l 


ai  k=i+l 
i-1  p 


k k k 


1 - E p.  + « . E — 
k=l  k 1 k=l  ak 


1-  ( £ 


n p. 


1 n pk 

i)  + ~(  ^ 4) 


k=i+l  akQ  ai  k=i+l 
n pv 

1 - o.  , + a.  (d-  E -*) 
1-1  1 k=i  ak 


1 n pk  1 n 
l-±(  E — )+-^d  E pk 
d k=i+l  °k  ai  k=i+l 


1-ct.  , + cy-d~a-(  £ — ) 


i-1  ' i i 


k=i  uk 


n 

E 

— ) + S PkJ 

k=i+l 

ak  k=i+l  * 

d - p . 

n pv 
-a.(  Z —) 

l 

1 k=i+l  °k 

Hence,  we  have 


n Pk 

(p-a  . ) + a .d  - or.  ( £ — ) 

111  k=i+l  ttk 

n pk 

1 - c.  + or-d  - «.(  E — ) 

11  1 k=i+l  ak 


or.EtW.]  L 

lim  z = 5 

p-»l  W 


This  completes  the  proof. 


As  an  immediate  consequence,  we  have  the  following: 

(4.2.2)  lira  E[W.l/E[W.l  = a,/ot-.  (l<i<n,  l<j<n) 

p-*l  1 3 J 

Stated  in  another  way,  we  have 

(4.2.3)  11a  E[W  l!E[W2] E(Wnl  - ^ ■ • • • ■ ■ 

p-»l  l z n 

This  result  attaches  a significant  meaning  to  the 
parameters  { They  are  "discrimination  factors"  that 

set  decisive  measure  to  the  ratios  of  mean  waiting  times 
among  the  priority  classes.  Under  saturated  congestion,  the 
reciprocals  of  the  relative  ratios  of  these  discrimination 
factors  are  the  relative  ratios  of  waiting  times  that  can  be 
expected . 

We  observe  the  following:  In  Section  3.2,  we  showed 
that  the  optimal  solution  to  the  objective  function 

n 2 

(3.2.29)  Minimize  £ 'kckE^Wk^ 

k=l 

with  only  the  constraint  (1.3.2)  must  satisfy 
(3.2.27)  E(W1]:ElW2]:...:E[Wn]  = : ^ : * * * : * 

If  we  now  examine  the  constraint  set  (2.1.4)  at 
saturated  congestion,  we  can  see  that  for  each  inequality 

E p.E[Wj  > (1- p)  ( E p,)W/(l-  E p.  ) , 
k€l  k k k€I  k€I  K 
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the  right  hand  side  vanishes  when  p-*l. 


Thus,  in  this  limiting  situation,  (2.1.4)  becomes 
ineffective  and  every  solution  satisfying  (1.3.2)  is 
optimal . 

Comparing  (3.2.27)  with  (4.2.3),  we  see  that  if  we  set 
(4.2.4)  (l<i<n) 

then  at  saturated  congestion,  an  escalating  priority 
scheduling  will  be  optimal  for  the  objective  function 
(3.2.29),  when  the  parameters  { ) are  set  equal  to  the 
values  in  (4.2.4).  Therefore,  when  the  system  congestion  is 
heavy,  (i.e.,  when  p is  close  to  1),  (4.2.4)  can  be  used  to 
give  an  approximate  optimal  solution  to  (3.2.29). 


4.3  Variance  of  Waiting  Times 


In  this  section,  we  investigate  the  variances  of 
waiting  times  for  escalating  priority  scheduling  and  compare 
them  with  the  variances  of  FCFS  and  fixed  priority 
scheduling . 


We  first  give  the  following  theorem  on  the  variances  of 
waiting  times  for  FCFS  and  fixed  priority  schedulings  (c.f., 
[TAKA64] ) . 
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THEOREM  4.4. 


In  multiclass  M/G/l  queueing  systems  with  p < 


(4.3.1) 


(4.3.2) 


1,  we  have  the  following: 

(a)  Under  FCFS  scheduling,  the  variance  of 
waiting  times  for  all  classes  is 

— 2 n 3 

Var  [W  FCFS1  - W + ( £ Xk  E[ik]  ) / 3 (1-p)  . 

k~*  1 

(b)  Under  fixed  priority  scheduling,  the 
variance  of  waiting  times  for  priority  class  i 
( 1 <.i  <.n ) is 


Var  lWi)Fp 


£ XVE[S^] 
k=l  K K 

3 (1-CTi_i) 2 (1-CTi) 


— ^ 2 

W(l-p)  ( £ E[S  ] ) 

k=l 


+ 


— 1 2 — 

W(l-p)  { £ X E[S  ] -W(  1-P ) j 

k=l  * * 

(1"ai-l)2(1_ai)2 


As  a corollary,  we  have 

(4.3.3)  VarlwilFP  < Vartw2]FP  < < VarfWn]FP* 

From  the  above  theorem,  we  note  the  following:  First, 

under  FCFS  scheduling,  we  have 
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(4.3.4) 


Var  [Wi.l_.___  / E [Wi]!L 

FCFS  FCFS 


* Vac  twilFcps  / w2 


= 1 + ( £ \k  E[s£]  )/  3 w2  (1-p) 

k — 1 


Thus,  the  coefficient  of  variation  (Cv)  of  the  waiting 
time  variable  wfcfs'  defined  as  the  square  root  of 
Var  [WFCFS] /E  [W^^cf  , is  greater  than  one. 


FCFS 


than  one. 


As  a reference,  the  coefficient  of  variation  of  an 
exponentially  distributed  random  variable  is  one.  Thus,  the 
density  function  of  the  waiting  time  variable  W„„r.c.  is 

f Lfb 

flatter  than  an  exponential  function,  and  has  a "longer 
tail" . 


Second,  for  fixed  priority  scheduling,  using  (4.3.2), 
the  variance  of  the  lowest  priority  class  is 


E X.  E[SH 
. k=l  K * 


(4.3.5)  Vac[Wn)Fp=  *=± -2— 

3(l-an_1)  (1-p) 


_ n- 1 2 

W £ XkE[sn 
k=l  * 

( 1— c ,)3 

n-1 


U-'n-l*' 


From  Theorem  1.3,  we  have 
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(1.3.6) 


\ 


E[W  I - W / (!-«„_!> 


Thus , 


(4.3.6)  Var  [WJ  pP  / E [wn]  FP 

n •»  n_1  2 
Z XkE[S^]  E XkE[Sk] 

= l + ^~ + *=± 

3W  (1-p)  W(l-an_x) 

> Var [WpcFS] /E [WFCFg] 

Therefore,  for  the  lowest  priority  class,  not  only  is 
Var [Wn] Fp  greater  than  Var lwFCFS] , but  the  coefficient  of 
variation  (which  reflects  the  relative  dispersion  of  waiting 
times)  is  also  greater  than  the  coefficient  of  variation  of 
waiting  when  FCFS  scheduling  is  used. 


Third,  for  the  highest  priority  class,  (4.3.2)  gives 


n 


kJ/kE[Sk>  W(l— Plk^lsp  w2  (1_P): 

- -*  / 1 v + T ^ 


(1-Pi)' 


(4.3.7)  Var[Wl]Fp=  3(1^1 


Thus,  Var[WFCps]  - Var  [W-jJpp 


n 3 n 2 

£ L E[S")  £ X E[S  ] 

-2  k=l  K K k=l  K 

= W + 


d-Pi)' 


3 ( 1—  p ) 


3 ( l~p  2 ) 


W(l-p)  (W(l-p)-\1E[S1l  J 


d-P^' 


> W2  + W(l-p)  [\  £ vk  E[Sk]  - X^S^]  } / (1-Pj/ 


k=l 
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Li 


> W2  - E[WlJ  {j  Xk  E[SkJ  } / (1-p^ 

= W - E t W L ] pp 

> 0. 

Therefore,  by  giving  top  preference  to  a class,  not 
only  will  the  mean  waiting  time  for  jobs  in  this  class 
decrease,  the  variance  of  waiting  will  also  decrease.  As  to 
the  coefficient  of  variation  of  waiting  times  for  this 
class,  no  general  conclusion  can  be  drawn. 

In  escalating  priority  scheduling,  we  know  that  the 
control  parameters  can  be  used  to  adjust  the  preferences 
given  to  each  class  and  the  stronger  the  discrimination 
applied,  the  more  dispersion  between  waiting  times  of 
different  priority  classes  there  is.  We  also  know  that  the 
mean  waiting  times  for  jobs  that  belong  to  the  higher 
priority  classes  will  decrease  as  the  preferences  given  to 
these  classes  increase.  Similarly,  for  lower  priority 
classes,  the  increase  of  discrimination  applied  to  them  will 
increase  their  mean  waiting  times.  It  is  desirable  to  also 
understand  how  changes  in  discriminations  affect  the 
variance  of  waiting  times  for  jobs  within  each  priority 
class . 

To  investigate  the  variance  behavior  of  escalating 
priority  scheduling,  a simulation  model  of  a 3-class 
priority  queueing  system  was  developed.  The  details  of  the 
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simulation  is  given  in  Appendix  A.  In  the  following,  we 
present  the  results  of  the  variance  behavior  from  the 
simulation  experiments. 

In  Figure  4.6,  the  theoretical  mean  waiting  times  of 
the  model  under  different  levels  of  discrimination  in 
escalating  priority  scheduling  are  shown.  For  this  3-class 
priority  scheuling  rodel,  we  have  two  degrees  of  freedom  to 
adjust  the  parameters.  Let  r 1 = c *2/°i  and  r2  =a^ot2  be  the 
ratios  of  the  control  parameters.  Recalling  that  we  have 


restricted  the  { 

} to  be 

such 

that  Za2— a3'  we 

therefore 

have  0<r.  <1  for 

1 — 

1 < i<2 . 

The 

three  surfaces 

above 

the 

feasible  values  of 

(ri ' r2} 

are 

the  mean  waiting 

times 

for 

the  three  different  classes.  The  waiting  times  performance 
corresponding  to  either  r^  being  zero  — which  represent  the 
behavior  of  MEFP  scheduling  — is  added  to  make  the  surfaces 
closed . 

For  this  model,  if  we  only  consider  the  parametric 
change  ( a2*  “3)  = (a/k,a,ak)  for  k:l-»0  (a  is  an  arbitrary 
positive  constant),  then  r-^  =r2=k  and  0<k<l.  The  mean 
waiting  times  under  this  change  of  discriminations  are  shown 
in  Figure  4.7. 
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It  is  interesting  to  note  that  in  Figure  4.7,  when  k 
decreases  below  approximately  0.5,  the  difference  between 
E[W^]  and  EtV^l  increases  faster  than  the  difference  between 
EtV^j  and  E[W^].  This  shows  that  when  k is  in  this  range, 
the  discrimination  between  EIW3]  and  E [W2 ] caused  by  the 
decreasing  of  /a^  is  stronger  than  the  discrimination 

between  EtW^l  and  E[W^]  caused  by  the  decreasing  of 
by  the  same  amount. 

In  Figure  4.8,  the  mean  waiting  times  of  the  simulation 
experiments  and  the  theoretical  mean  waiting  times  are 
compared.  The  solid  lines  represent  theoretical  waiting 
times  for  different  priority  classes,  broken  lines  represent 
simulated  waiting  times,  and  dotted  lines  represent  upper 
and  lower  90%  confidence  limits  of  the  simulated  waiting 
times.  It  is  seen  that  for  each  priority  class,  the 
confidence  interval  derived  from  the  simulation  experiment 
contains  the  theoretical  values  of  waiting  times. 


Figure  4.8  Point  Estimates  and  Confidence 

Intervals  of  Mean  Waiting  Times 

For  the  same  simulation  experiments.  Figure  4.9  gives 
the  variances  of  waiting  times  and  their  corresponding  90% 
confidence  intervals.  The  marks  "x"  are  the  theoretical 
values  of  variances  of  the  two  limiting  cases,  FCFS  and 
fixed  priority  scheduling.  Note  that  each  of  the  confidence 
intervals  at  the  two  limits  also  contain  the  theoretical 
values.  These  results  show  that  as  discriminations 
increase,  the  variance  for  each  priority  class  changes 
strictly  monotonically , and  it  either  decreases  or  increases 
depending  on  whether  the  particular  class  is  a high  priority 
or  a low  priority  class. 
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Figure  4.9  Variance  Behavior  of  Waiting  Times 


Figure  4.10  gives  the  coefficient  of  variation  (Cv) 

behavior  for  each  class.  We  note  that  each  curve  of  Cv[Wi] 

connecting  Cv[Wil  and  Cv[Wi]  is  approximately  linear. 

FCFS  FP 

Experiments  similar  to  the  above  were  conducted  for 
several  different  service  time  distributions.  In  all 
experiments,  the  variances  of  escalating  priority  scheduling 
were  either  strictly  increasing  or  strictly  decreasing 
between  the  variances  of  FCFS  and  fixed  priority  scheduling. 

Therefore,  we  conclude  that  we  can  determine  the  upper  and 
lower  bounds  of  the  variance  of  any  priority  class  in 
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escalating  priority  scheduling  from  the  variances  of  FCFS 
and  fixed  priority  scheduling  and  obtain  a "crude"  estimate 
of  its  variance  for  a given  parameter  setting  by  using  a 
coefficient  of  variation  estimate  from  curves  similar  to 
those  given  in  Figure  4.10. 


/N 

CvCW23 


/N 

CvCw,  ] 


Figure  4.10  Coefficient  of  Variation 
Behavior  of  Waiting  Times 

Summarizing  the  results  of  our  simulation  study  on  the 
variances  of  waiting  times  in  escalating  priority 
scheduling,  we  find  that  as  the  discriminations  increase, 
the  variance  of  each  class  changes  in  the  same  direction 
(increases  or  decreases)  as  the  mean  waiting  times,  and  has 
a value  between  the  variances  of  FCFS  and  fixed  priority 
scheduling  . 
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CHAPTER  5 


RELATED  TOPICS  AND  CONCLUSIONS 

5 . 0 Introduction 

In  the  previous  chapters,  we  have  investigated  in 
detail  the  behavior  of  M/G/l  queueing  systems  under 
escalating  priority  scheduling.  In  Section  5.1  of  this 
chapter,  we  discuss  and  compare  these  results  to  the  results 
of  three  other  adjustable  priority  scheduling  rules.  In 
Section  5.2,  we  discuss  the  implementation  of  escalating 
priority  scheduling  in  an  on-line  transaction  processing 
environment  and  some  of  its  implications.  In  Section  5.3, 
we  give  final  conclusions  of  this  dissertation  and  point  out 
several  possible  directions  for  future  research  on 
Transaction  Processing  Systems  and  priority  scheduling 
rules . 

5 . 1 Other  Adjustable  Priority  Scheduling  Rules 

In  this  section,  we  will  discuss  and  compare  results  of 
three  other  adjustable  priority  scheduling  rules  with 
escalating  priority  scheduling. 

Specifically,  we  will  discuss  the  adjustable  priority 
scheduling  rules  with  the  following  priority  index 
functions : 

(a)  q±(t)  = Pi(t-Ti)r,  ( P ±> 0 , r>0) 

(b)  qi(t)  = , ( V^O) 
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(c)  q i( t)  = vi  + (t- 


Each  of  the  above  priority  functions  is  the  priority 

index  assigned  to  a job  from  class  i which  arrived  to  the 

system  at  time  . Whenever  the  service  facility  is 

available,  the  job  with  the  highest  instantaneous  priority 

index  q.(t)  is  taken  into  service.  We  will  now  discuss  each 
1 

of  these  scheduling  rules  in  more  detail. 


(a) 

a job's 
power  of 
at  time 
arrived 


First,  we  consider  the  priority  discipline  in  which 
priority  increases  in  proportion  to  some  arbitrary 
its  elapsed  time  waiting  in  the  system.  That  is, 
t,  the  priority  associated  to  a job  in  class  i that 
at  time  is  assigned  a value 


q i ( t ) = 0i  ( t - t.  )r,  ( Bi  > 0) 


where  r is  a fixed  positive  constant  and  t ranges  from 
until  this  job  gets  the  service.  The  interaction  of 
priority  indices  between  jobs  of  different  priority  classes 
is  illustrated  in  figure  5.1. 


Figure  5.1  Interaction  Between  Two  Jobs 

From  Different  Priority  Classes 
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In  this  illustration,  a job  from  class  i arrives  at 
time  and  gains  priority  proportional  to  the  r-th  power  of 
the  time  it  spent  in  the  system.  At  time  Tj  , another  job 
from  a higher  priority  class  j enters  the  system,  and  gains 
its  priority  with  a larger  proportionality  constant  £3  ^ . 
Thus,  if  the  service  facility  becomes  available  between  t 
and  t*,  the  job  from  class  i will  be  taken  into  service. 
However,  if  the  facility  does  not  become  free  until  t*  or 
after,  then  the  job  from  class  j will  be  chosen  for  service. 

This  scheduling  rule  was  first  studied  by  Kleinrock  and 
Finkelstein  [KLEI67]  and  was  called  the  r-th  order  time 
dependent  scheduling.  The  following  theorem,  due  to 
Kleinrock  and  Finkelstein,  states  that  this  scheduling  rule 
is  equivalent  to  escalating  priority  scheduling.  Similar  to 
his  other  results  on  escalating  priority  scheduling,  this 
theorem  is  true  for  M/G/l  queueing  systems, 

THEOREM  5.1.  An  r-th  order  time  dependent  scheduling  with 
parameters  { is  equivalent  to  an  escalating 

priority  scheduling  with  parameters  { } when 

oi  = & ^ for  all  i,  l£i<n. 

From  the  above  theorem,  the  mean  waiting  time  for  each 
priority  class  under  r-th  order  time  dependent  scheduling 
can  be  obtained  from  (1.3.10)  as: 
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1-  I (1-  (g^)r)pk 
k=l  K 

The  equivalence  of  tne  two  scheduling  shown  in  Theorem 
5.1  reveals  that  they  have  the  same  behavior  and  the  same 
feasible  performance  space  of  mean  waiting  times. 


(b)  Next,  we  discuss  what  we  call  "deescalating 
priority  scheduling  discipline"  wh1HrS~'Ba'et*---giifctLs  priority 
decreases  linearly  as  it  waits  in  the  system.  Specifically, 
we  give  to  each  priority  class  i a negative  constant  Vj_  . At 
each  instant  t,  the  priority  index  q^(t)  of  a job  from  class 
i which  arrived  to  the  system  at  time  t.  is 

qjL(t)  ■ V±  ( t - T±  ) . ( yL  < o ) 


Figure  5.2  illustrates  the  interaction  between  two  jobs 
from  different  classes.  In  this  example,  at  time  t^,  a job 
from  class  i arrives  and  loses  priority  at  a rate  3^  . At 
time  Tj  , a different  job  from  a lower  priority  class  j 
arrives,  and  loses  its  priority  at  a faster  rate  y j-  If  the 
service  facility  becomes  free  between  and  t*,  the  job 
from  class  j will  be  served  first.  However,  if  the  service 


facility  does  not  become  free  until  after  t*,  then  the  job 
from  class  i will  be  served  prior  to  the  job  from  class  j. 


Figure  5.2  Interaction  Between  Priority  Functions 
for  Deescalating  Priority  Scheduling 

The  following  theorem,  due  to  Hsu  [HSU70]  , describes 
the  mean  waiting  times  behavior  of  jobs  from  each  priority 
class.  This  result  was  originally  proved  for  M/M/1  queueing 
sytems.  However,  it  holds  true  also  for  M/G/l  systems  using 
the  argument  presented  in  Section  1.3. 


THEOREM  5.2.  In  multiclass  M/G/l  queueing  systems  under 
nonpreemptive  deescalating  priority  scheduling, 
if  p < 1 and  we  are  given  a set  of  negative 
parameters  { y^}  such  that  Vj_  > V2  >.  ...  > Yn, 
the  mean  waiting  times  for  each  priority  class 
is: 


E[Wi] 


i-1 

(1-P)W+  £ (1- 

k=l 

n 

(1-P)+  ^ 

k=i+l 


-)pkEfV 
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The  limits  of  this  scheduling  are  given  in  the 
following  theorem: 

THEOREM  5.3.  (a)  When  we  set  the  V^'s  all  equal,  then 

deescalating  priority  scheduling  becomes  LCFS 
scheduling . 

(b)  In  the  limiting  situation  of  deescalating 
priority  scheduling  when  the  parameters  { } 

are  chosen  such  that  V^/Y^+1-*  U (l£i<n-l),  then 
we  have  a modified  fixed  priority  scheduling 
rule  such  that  within  each  priority  class,  a 
LCFS  rule  is  followed. 

For  comparisons  of  waiting  times,  we  give  the  following 
results  due  to  Wishart  [W1SH60]  and  Durr  [DURR69 ] . 


THEOREM  5.4.  In  multiclass  M/G/l  queueing  systems  with 
p < 1,  we  have  the  following: 

(a)  Under  LCFS  scheduling,  the  mean  and  variance 
of  waiting  times  are  given  by 


E ^ WLCFS 1 W 


var  [Wjjcfs  1 = (1+p)w2/(1-p)-M  I \re [S^]  ) /3  ( 1-p ) 2 . 


(b)  Under  modified  fixed  priority  scheduling  in 

which  an  LCFS  rule  is  followed  within  each 
priority  class,  we  have: 


1 


Lt 
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E[Wi]  * 


(l-o)w  / (l-Oi_1)  ( 1— CT  ^ ) . 


VarlWiJ  = — - — 


L KE[SW] 
k=l  K K 


(l-o)W(  Z X.E[S  ] ) 
k=l 


3(1"°i-l)2(1“Ci)  (l~ai_1) (l-ai)3 


(l-p)W(  Z X E[Sv]-(l-p)W] 
k=l  K K 

+ 2 2" 

(1-ai-l)  (1"ai) 

Comparing  the  above  formula  with  those  in  Theorems  1.2 
and  1.3,  we  see  that  mean  waiting  times  under  either  FCFS  or 
LCFS  scheduling  are  the  same.  Similarly,  mean  waiting  time 
for  each  priority  class  remains  the  same  no  matter  whether 
an  FCFS  or  an  LCFS  scheduling  is  followed  within  the  same 
priority  class.  However,  variances  under  LCFS  scheduling 
are  larger  than  that  under  FCFS  scheduling,  and  in  fixed 
priority  scheduling,  variance  of  waiting  times  for  each 
class  is  larger  if  LCFS  rule  is  used  instead  of  the  FCFS 
rule  within  the  same  priority  class. 


It  is  most  likely  that  an  algorithm  can  be  found  for 
determining  the  values  of  { } for  deescalating  priority 
scheduling  using  a method  similar  to  that  given  in  Theorem 
2.4  for  escalating  priority  scheduling.  We  note  that  the 
feasible  performance  space  of  mean  waiting  times  for  these 
two  scheduling  rules  are  identical.  However,  for  the  same 
performance  of  mean  waiting  times,  deescalating  priority 
scheduling  will  have  larger  variances  of  waiting  times. 
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(c)  We  now  discuss  the  scheduling  rule  for  the 
following  priority  function: 

q^t)  = >ii  + ( t - Ti  ) . 

In  this  scheduling  rule,  higher  priorities  are  given  to 
the  classes  with  larger  values  of  the  v^'s,  because  for  two 
joos  arriving  to  the  system  at  the  same  time,  the  one  from 
the  class  with  larger  value  of  will  have  a larger 
priority  index.  Also,  note  that  in  this  rule,  the 
parameters  { are  not  required  to  be  positive. 

Now,  if  the  quantity  ( Tj-vi)  is  the  due  date  of  a job 
that  arrives  to  the  system  at  time  , then  the  selection  of 
tne  job  with  the  largest  q^(t)  = v^+(t-T^)  = t-(T^-v)  at 
any  instant  of  time  t is  equivalent  to  selecting  the  job 
with  smallest  values  of  t^-v^,  i.e.,  the  job  with  the 
smallest  due  date.  Thus,  this  scheduling  is  called  the 
earliest  due  date  (EDD)  scheduling  when  the  v^'s  are  chosen 
in  this  way  and  we  will  restrict  ourselve  to  this 
interpretation  in  the  remaining  discussion  of  this  priority 
function . 

Figure  5.3  illustrates  the  priorities  of  two  jobs  from 
different  priority  classes.  It  is  noted  that  no  interaction 
of  priorities  exists  between  the  two  jobs  under  this 
scheduling  rule. 
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The  above 


conjecture , 


known 


as 


the  Jackson's 


Conjecture,  was  proved  by  Goldberg  IGOLD76]. 

It  is  interesting  to  note  the  similarity  of  this  result 
with  Theorem  4.3:  In  escalating  priority  scheduling,  when 
the  priority  of  a job  from  class  i is  VT  , we  have  the 
following : 

lim  (o-E  [Wi]  ) / (<*-iE  [W^]  ) = 1.  ( l<i<n , l<j<n) 

p-*l  1 3 

Thus,  Theorem  4.3  can  be  thought  of  as  the 
multiplicative  version  of  Jackson's  Conjecture  for 
escalating  priority  scheduling  systems. 

Next,  we  recall  from  Theorem  4.2  that  under  escalating 
priority  scheduling,  when  p<  1,  we  have: 

(4.1.3)  a-jEtW-jJ  > c*2E[W2]  > ...  > »nE[Wn). 

Similar  to  this  result,  Goldberg  proved  the  following: 
In  nonpreemptive  EDD  scheduling,  for  p < 1, 

vx  + ElW^  > v2  + EIW2]  > ...  > vn  + E [ W n ] . 

Goldberg  also  proved  that  the  limits  of  EDD  scheduling 
are  FCFS  and  fixed  priority  scheduling,  and  derived  the 
following  formula  for  mean  waiting  times  of  each  priority 
class : 


I 
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- 1 11  ' 1 1 — 1 ' " ■"  11  ■ ' mm*"  1 ' 

_ i-1  vk"vi  n rvi-vk 

E[Wi]  = W + £ pv\  Pc {Wi>t}dt  - £ Pv  \ Pr{W  >t}dt. 

k=l  k=i+l 

Unfortunately,  the  above  formula  can  not  be  directly  solved 
to  obtain  l given  a set  of  specified  and  feasible  mean 

waiting  times. 

We  conclude  from  the  above  discussions  that  adjustable 
priority  scheduling  rules  with  one  set  of  adjustable 
parameters  can  provide  several  different  types  of  waiting 
times  behaviors.  It  may  be  possible  to  obtain  the  same  mean 
waiting  times  for  the  different  priority  classes  using 
different  adjustable  scheduling  rules,  but  the  variances  of 
the  waiting  times  may  be  different.  To  obtain  control  of 
behaviors  of  both  the  mean  and  variance  of  waiting  times, 
scneduling  rules  with  more  than  one  set  of  parameters  may  be 
needed.  Thus,  for  instance,  if  we  use  a scheduling  rule 
with  priority  function  defined  as 

q i(t)  = vi  + °i  (t-Ti)  » 

then  the  behavior  of  waiting  times  may  be  able  to  be 
controlled  to  a greater  extent. 


5 . 2 Escalating  Priority  Scheduling  for  TP  Systems 

In  Chapter  1 of  this  dissertation,  we  pointed  out  that 
transaction  processing  systems  commonly  use  fixed  priority 
scheduling  with  the  result  that  transactions  from  lower 


times  while  transactions  from  higher  priority  classes  may 
have  response  times  that  are  faster  than  necessary.  This 
problem  motivated  us  to  study  escalating  priority  scheduling 
to  determine  if  it  would  be  appropriate  for  use  in  TPS.  In 
this  section,  we  will  discuss  the  use  of  escalating  priority 
scheduling  in  the  class  of  TPS  that  can  be  modeled  as 
multiclass  M/G/l  queueing  models. 

The  use  of  escalating  priority  scheduling  in  TPS  will 
allow  the  designer  of  TPS  to  select  a set  of  values  for  the 
control  parameters  { } to  provide  any  desired 
discrimination  between  the  transaction  classes  from  FCFS  to 
fixed  priority  scheduling.  This  capability  should  help 
eliminate  undesired  long  response  times  for  transactions 
from  low  priority  classes  when  transactions  from  high 
priority  classes  are  receiving  faster  response  times  than 
necessary.  The  algorithm  presented  in  Chapter  2 allows  the 
designer  to  obtain  a set  of  { cv^  } for  any  set  of  feasible 
expected  response  times  for  the  various  transaction  classes. 


If  a designer 
control  parameters 
to  optimize  the  cos 
expected  waiting 
times  can  be  obta 
algorithm  presented 
then  can  be  used  in 
required  values  o 


of  TPS  desires 
{ } of  escal 

t of  a TPS  using 
times,  a set  of 
ined  by  using 
in  Chapter  3. 
the  algorithm  i 
f { } . This 


to  select  the  set  of 
ating  priority  scheduling 
some  function  of  the 
feasible  expected  waiting 
one  of  the  optimizing 
This  set  of  waiting  times 
n Chapter  2 to  obtain  the 
procedure  requires  the 


designer  to  choose  an  appropriate  cost  function  and  to 
obtain  the  appropriate  unit  cost  estimates.  One  should  note 
that  this  optimization  procedure  does  not  take  into  account 
other  aspects  of  response  times  that  may  be  important  such 
as  the  variances  of  response  times  or  the  system  response 
under  saturated  congestion. 

When  the  transaction  workload  becomes  very  heavy,  one 
possible  disadvantage  of  using  escalating  priority 
scheduling  in  TPS  is  that  the  response  times  of  all  classes 
of  transactions  increase  together.  This  is  quite  different 
from  what  occurs  when  using  fixed  priority  scheduling  where 
only  the  low  priority  classes  response  times  increase  under 
a heavy  load  of  transactions.  This  can  be  avoided  if  the 
designer  of  the  TPS  uses  MEFP  scheduling.  However,  one 
significant  advantage  of  using  escalating  priority 
scheduling  is  that  for  small  changes  in  transaction  traffic, 
the  relative  response  times  for  different  transaction  classes 
stay  approximately  the  same. 

The  variances  of  response  times  are  important  in  many 
TPS  because  they  can  affect  the  productivity  of  the  human 
users.  Our  results  of  escalating  priority  scheduling  show 
that  the  variances  of  response  times  for  lower  priority 
classes  are  much  smaller  than  in  fixed  priority  scheduling. 
However,  the  variances  of  the  higher  priority  classes  will 
increase . 
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We  have  now  stated  that  escalating  priority  scheduling 
is  applicable  and  that  there  are  many  advantages  for  using 
it  in  TPS  instead  of  fixed  priority  scheduling.  However, 
any  scheduling  rule  used  in  TPS  must  require  only  a small 
amount  of  computation  time  for  each  arriving  transaction  and 
for  determining  the  next  transaction  for  service.  This  is 
because  the  transaction  traffic  is  large  and  the  processing 
time  required  for  each  transaction  is  usually  very  small. 
Therefore,  the  overhead  for  handling  each  transaction  must 
be  small. 

We  now  discuss  the  implementation  of  the  escalating 
priority  scheduling  rule.  First,  we  note  that  under  this 
scheduling  rule,  jobs  from  the  same  priority  class  will  be 
selected  strictly  according  to  their  arrival  times.  This  is 
because  jobs  from  the  same  class  have  the  same  priority 
increasing  rate.  Thus,  the  job  that  has  been  waiting  in  the 
system  for  the  longest  period  will  have  the  highest 
priority.  Therefore,  if  we  maintain  a separate  queue  for 
each  priority  class,  and  when  a job  arrives,  we  file  it  into 
the  end  of  the  queue  corresponding  to  its  priority  class, 
then  only  the  first  job  of  each  queue  can  qualify  as  the 
candidate  for  selection.  This  reduces  the  number  of 
possible  comparisons  in  selecting  the  next  job  for  service 
to  n,  the  number  of  different  priority  queues. 
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The  above  observation  also  has  the  following 
implication:  Even  though  the  priority  indices  of  the 
waiting  jobs  increase  dynamically,  there  is  no  need 
to  update  the  priority  indices  of  all  jobs  every  time  the 
server  becomes  available.  We  only  need  to  register  the 
arrival  time  for  each  job  when  it  enters  to  the  queue.  When 
the  server  is  ready  for  next  service,  we  then  take  measures 
of  the  elapsed  time  that  each  prospective  candidate  (the 
first  job  in  each  queue)  has  waited  in  the  queue,  and 
multiply  them  respectively  by  their  corresponding  priority 
increasing  rates  to  calculate  their  priority  indices.  The 
job  with  highest  priority  index  is  then  removed  from  the 
queue  and  is  put  into  service. 

This  shows  that  escalating  priority  scheduling  can  be 
efficiently  implemented  in  transaction  processing  systems 
with  very  little  time  required  for  overhead  information 
processing . 

5 . 3 Conclusions  and  Future  Research 

The  purpose  of  this  dissertation  was  to  investigate 
escalating  priority  scheduling  and  determine  if  it  is 
appropriate  for  use  in  TPS.  The  major  result  of  our 
investigation  of  escalating  priority  scheduling  was  the 
development  of  a two-stage  algorithm  for  determining  the 
values  of  the  control  parameters  to  optimize  various  cost 
functions.  This  two-stage  algorithm  allows  a designer  of 
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TPS  to  obtain  a set  of  control  parameters  for  escalating 
priority  scheduling  in  TPS  to  optimize  certain  types  of  cost 
functions  that  should  provide  better  response  times  to  users 
of  TPS  than  currently  being  provided  by  fixed  priority 
scheduling.  Thus  this  scheduling  is  applicable  to  TPS  and 
should  provide  better  utilization  of  the  system  resources. 

This  dynamic  priority  scheduling  rule  was  first 
proposed  by  Kleinrock  in  1964.  His  major  contribution,  in 
addition  to  proposing  the  rule,  was  in  developing  an 
algorithm  to  obtain  the  values  of  mean  waiting  times  given 
the  control  parameters  { } . The  major  new  results  for 
this  scheduling  rule  presented  in  this  dissertation  are: 
(1)  the  derivation  of  the  feasible  performance  space  for 
mean  waiting  times,  (2)  the  development  of  an  algorithm  to 
obtain  a set  of  { given  a set  of  feasible  mean  waiting 
times  {E[Wi]},  (3)  optimizing  algorithms  to  obtain  the 
{ E [ Wi ] } to  be  used  in  obtaining  the  { for  minimizing 
various  cost  functions,  and  (4)  proving  a multiplicative 
version  of  Jackson's  conjecture  in  saturated  congestion. 

Other  results  presented  for  the  escalating  priority 
scheduling  rule  are  that  the  variance  of  waiting  times  for 
each  class  either  strictly  increases  or  strictly  decreases 
between  the  variances  of  FCFS  and  fixed  priority  scheduling 
for  various  parametric  settings;  at  saturated  congestion 
the  mean  response  times  of  all  classes  are  extremely  large; 
and  the  relative  response  times  for  each  class  have  little 
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change  for  changes  in  the  arrival  rates. 


There  are  a number  of  interesting  problems  that  need  to 
be  studied  for  further  understanding  of  scheduling 
transaction  processing  systems.  The  question  that  arises 
first  is:  What  is  the  behavior  of  TPS  using  escalating 
priority  scheduling  when  there  is  only  a small  number  of 
users  instead  of  a large  number?  This  requires 
investigation  of  the  multiclass  M/G/l  queueing  model  with  a 
finite  arrival  population  under  escalating  priority 
scheduling.  This  may  be  difficult  because  the  arrival  rate 
changes  as  the  number  of  transactions  that  are  waiting  for  or 
being  processed  increases  or  decreases. 

The  second  question  that  arises  is:  What  happens  in 
escalating  priority  scheduling  if  the  arrivals  are  not 
Poisson?  This  could  occur  if  the  number  of  transaction 
arrivals  for  a given  class  was  not  large. 

A third  area  for  research  is  to  investigate  and  compare 
other  adjustable  scheduling  rules.  Of  particular  interest 
is:  What  are  the  relationships  between  the  mean  and 
variance  of  different  adjustable  scheduling  rules?  Can  the 
same  mean  response  times  be  achieved  for  different 
scheduling  rules  but  with  different  variances? 

Another  large  area  for  research  is  when 
multiprogramming  is  allowed  in  TPS.  In  this  case,  queueing 
network  models  must  be  used.  Here,  the  various  arrival 


processes,  service  time  distributions  and  different 
scheduling  rules  with  optimization  need  to  be  researched. 

Finally,  it  would  be  valuable  to  collect  some  "live 
data"  on  various  systems  for  analysis  and  to  perform  various 
types  of  experiments  using  different  scheduling  rules.  Some 
measurements  of  interest  are  the  arrival  processes,  the 
service  time  requests,  waiting  time  distributions,  and  mean 
response  times.  This  would  increase  the  understanding  of 


the  behavior  of  transaction  processing  systems. 
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APPENDIX  A 


A SIMULATION  MODEL  FOR  ESCALATING 
PRIORITY  SCHEDULING 

In  order  to  study  the  variance  behavior  of  waiting  times 
in  m/g/1  queueing  systems  under  escalating  priority  scheduling, 
a simulation  model  using  SIMSCRIPT  II. 5 was  built.  This  model 
is  a 3-class  priority  queueing  model  with  each  of  the  three 
priority  classes  having  the  same  common  characteristics; 

\ =0.5,  n^  = 2.5  and  p^=0.2. 

Four  sets  of  experiments  were  conducted,  each  using  a 
different  service  time  distribution.  The  same  service  time 
distribution  was  used  for  each  of  the  three  classes  of  trans- 
actions in  each  of  the  sets  of  experiments  to  hopefully  have 
the  results  reflect  only  the  variance  behavior.  The  service 
time  distributions  used  were  the  following-. 

Distributions  Coefficient  of  Variations 

Constant  0 

Erlang-4  0.5 

Exponential  1.0 

Hyperexponential  2 . 0 

The  values  of  the  control  parameters  were  determined 

2 

using  {ot a 2>  ot ^ = (a,ak,ak  ) with  k decreasing  from  one 
towards  zero  and  a > 0 . This  resulted  in  the  relative  dis- 
criminations between  classes  1 and  2 and  classes  2 and  3 
being  the  same  for  each  setting  of  k because  ^ = 

a3^0l2  = ' <I^ie  sPecific  set  of  values  of  k used  for  each 
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set  of  experiments  were  k=l,  4/5,  2/3,  1/2,  1/3,  l/5,  l/lO, 
and  1x10  . The  setting  k equals  one  corresponds  to 

f irst-come-f irst-serve  scheduling  and  the  setting  k equals 
1x10  '*'0  approximates  fixed  priority  scheduling. 

The  regenerative  method  of  analysis  was  used  to  collect 
and  analyze  the  simulation  data  [SARG76] . The  regenerative 
point  selected  was  the  beginning  of  a busy  cycle.  The  num- 
ber of  cycles  used  were  2,500  for  the  set  of  experiments 
having  constant  service  time  distributions  and  3,000  for 
the  other  service  time  distributions.  Data  were  collected 
on  the  first  and  second  moments  of  the  waiting  times. 

For  each  service  time  distribution,  using  the  sample 
mean  of  waiting  times,  we  constructed  a 90%  confidence  inter- 
val of  mean  waiting  times.  In  all  cases,  the  theoretical 
values  were  contained  in  the  confidence  intervals.  This 
validated  the  model.  The  data  for  the  set  of  experiments 
using  Erlang-4  service  time  distributions  are  shown  in 
Figure  4.8. 

Using  the  sample  second  moments  of  waiting  times  and 
theoretical  values  of  mean  waiting  times,  we  calculated  the 
point  estimates  and  confidence  intervals  for  the  variances 
of  waiting  times.  For  the  cases  when  k=l  and  k=lxl0~^°, 
the  theoretical  values  of  variances  were  obtained  from 
(4.3.1)  and  (4.3.2),  respectively.  Again  the  90%  confidence 
intervals  contain  the  theoretical  values.  This  provided 
additional  evidence  that  the  model  is  valid.  The  data  for 
the  set  of  experiments  using  Erlang-4  service  time  distribu- 
tions are  given  in  Figure  4.9. 
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The  point  estimates  and  confidence  intervals  of  the 
coefficient  of  variance  (Cv[W^])  were  calculated  for  all 
experiments  using  the  point  estimates  of  the  means  and 
variances  of  the  waiting  times.  To  calculate  the  confi- 
dence intervals  of  CvfVr]  , we  used  the  following  argument, 
assuming  independence,  from  probability  theory: 

Given  Pr  { a < x < b]  = P%  and  Pr  (c  < y < d)  = , 

then  Pr  { a/d  < x/y < b/c]  < PQ%  . 

Using  this  argument  and  the  estimates  of  the  first  and 
second  moments,  81%  confidence  intervals  were  constructed 
for  all  of  our  experiments.  Figure  4.10  contains  the  point 
and  intervals  estimates  for  the  set  of  experiments  using 
the  Erlang-4  service  time  distributions. 

The  results  of  the  four  set  of  experiments  were  very 
similar  except  that  the  variances  of  the  waiting  times 
increased  as  the  coefficient  of  variations  of  service  times 
increased  from  zero  to  two  as  was  expected.  The  means  and 
variances  of  waiting  times  for  each  class  of  transactions 
strictly  increased  or  decreased  as  k decreased  from  one  to 
zero,  depending  on  whether  it  was  a high  or  low  priority 
class.  Furthermore,  the  coefficients  of  variations  were 
similar  for  each  set  of  experiments  in  that  the  coefficient 
of  variations  remained  nearly  constant  for  each  priority 
class  as  k decreased  from  one  to  zero. 
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